About the role
We are looking for a Senior Software Engineer with a passion for platform engineering to join our Wire Team. This role has a strong focus on improving internal Developers' Experience (DX) tools and platforms that are integral to the success of our development process.
Initially, a portion of your time will be spent working closely with the Wire Team. This will help you familiarize yourself with CAST AI's engineering practices and gain a deep understanding of our product. In the Wire team, your main responsibility will be managing and improving the observability (o11y) stack. Long term, you will have the exciting opportunity to transition and play a key role as one of the founding members of our Tooling Team, where you will shape and influence our approach to DX tooling and automation at scale. In this position, you will:
Maintain and optimize the observability (o11y) stack: manage Prometheus, Grafana, Loki, Phlare, Tempo, and other relevant observability tools. Ensure our monitoring, alerting, and logging systems provide a frictionless way to define engineering team-related alerts.
Improve continuous Integration and Delivery: manage and optimize CI/CD pipelines using tools like GitLab Pipelines, GitHub Actions, ArgoCD, and Helm, ensuring efficient and reliable deployment processes.
Development environment management: enable other engineering teams by maintaining and extending the existing local development tooling managed by Tilt.
Oversee incident management systems: integrate with incident management and alerting tools such as Opsgenie, Pagerduty, or similar to enhance our response capabilities and reduce downtime.
About the company
CAST AI is the leading Kubernetes automation platform for AWS, GCP and Azure customers. The company is on a mission to deliver a fully automated Kubernetes experience. What's unique about CAST AI is that its platform goes beyond monitoring clusters and making recommendations; it utilizes advanced machine learning algorithms to analyze and automatically optimize clusters, saving customers 50% or more on their cloud spend, improving performance and reliability, and boosting DevOps and engineering productivity.
The company has raised $73M from investors, including Cota Capital, Creandum, Uncorrelated Ventures, and Vintage Investment Partners. CAST AI has nearly 200 employees globally and is headquartered in Miami, Florida.