About the role
The Site Reliability teams at Datadog are responsible for ensuring that our high-volume, low-latency environments continue to perform around the clock. These teams collaborate closely with our product engineers to ensure that Datadog can monitor millions of servers and containers, ensuring our customers always have dependable and actionable data at their fingertips. You'll be responsible for shaping the infrastructure of our data-intensive, real-time services as we continue to grow at petabyte scale.
At Datadog, we place value in our office culture - the relationships that it builds, the creativity it brings to the table, and the collaboration of being together. We operate as a hybrid workplace to ensure our employees can create a work-life harmony that best fits them.
What You'll Do:
Keep our services reliable, available, fast and cost-efficient.
Respond to, investigate and fix service issues, whether they are deep in the OS kernel or in the application code.
Build tools and production frameworks to make our engineering team's lives easier.
Design, build and maintain the infrastructure we need to support orders of magnitude more customers.
Who You Are:
5+ years of experience in software engineering
You value correctness and efficiency; you leave no stone unturned when diagnosing production issues
You handle infrastructure with code because automation lets you focus on the more difficult and rewarding problems
You have production experience with distributed compute/storage tools, e.g. Kubernetes, Cassandra, Postgres, Kafka, Elasticsearch, Redis
About the company
Datadog (NASDAQ: DDOG) is a global SaaS business, delivering a rare combination of growth and profitability. We are on a mission to break down silos and solve complexity in the cloud age by enabling digital transformation, cloud migration, and infrastructure monitoring of our customers' entire technology stacks. Built by engineers, for engineers, Datadog is used by organizations of all sizes across a wide range of industries. Together, we champion professional development, diversity of thought, innovation, and work excellence to empower continuous growth. Join the pack and become part of a collaborative, pragmatic, and thoughtful people-first community where we solve tough problems, take smart risks, and celebrate one another.