griddable.io · 11 hours ago
Software Engineering MTS
Griddable.io is seeking a highly motivated Member of Technical Staff (MTS) to help build, operate, and scale an intelligent AIOps product that supports Tier-0 and Tier-1 services across Salesforce. The role involves developing product capabilities for observability, automated detection, root cause analysis, and remediation for large scale distributed systems, while collaborating with cross-functional teams to ensure system reliability and operational efficiency.
AnalyticsBig DataCloud Data ServicesData IntegrationInformation TechnologySaaSSoftware
Responsibilities
Develop and maintain core Warden AIOps product services, including data ingestion, signal processing, detection pipelines, and reliability workflows. This includes working with logs, metrics, traces, and events at scale, ensuring high performance and resilience
Analyze operational data and system behaviors to identify anomalies, recurring failure patterns, and performance regressions across distributed services. Use structured analysis and tooling to ensure accurate and reliable insights
Contribute to detection and causation capabilities by implementing rule based and AI assisted mechanisms that improve incident detection accuracy and reduce mean time to detection (MTTD) and resolution (MTTR)
Identify and address reliability, scalability, and performance issues in product components. Proactively document technical findings, bugs, and improvement areas to maintain a high bar for production readiness
Collaborate with cross-functional teams including Service Owners, infrastructure, security, and partner product teams to integrate Warden AIOps with upstream and downstream systems
Participate in design reviews, code reviews, and operational readiness activities, ensuring adherence to Salesforce engineering standards, security requirements, and compliance expectations
Produce clear technical documentation and recommendations that translate complex system behavior into actionable guidance for service owners and product stakeholders
Qualification
Required
Bachelor's degree in Computer Science, Engineering, or equivalent practical experience
Strong proficiency in one or more backend programming languages (e.g., Java, Go, Python), with experience building and operating distributed systems
Understanding of cloud native architectures, microservices, and service-to-service communication patterns
Strong analytical and problem solving skills, with the ability to reason about complex system behaviors and failure modes
Effective written and verbal communication skills, with the ability to collaborate across teams and clearly document technical work
Preferred
Good to have, experience working with observability data such as metrics, logs, traces, or events, and familiarity with monitoring or reliability concepts
Company
griddable.io
Griddable.io is a San Jose, CA based SaaS startup that closed Series A funding in 2017 from August Capital, Artiman Ventures, and Carsten Thoma, founding CEO of Hybris (acquired by SAP).