About the role
We're looking for a Staff Software Engineer to join the Analytics Data Platform department and help scale our data processing platform, which enables hundreds of data practitioners and AI Engineers at Datadog to build, deploy, and operate both batch and stateful streaming data pipelines at scale.
The Analytics Data Platform organization provides a fully managed, integrated ecosystem that empowers teams across the company to easily create, manage, access, and use data. Our platform supports customer-facing products that rely on analytical data—such as Cloud Cost Management, Metrics, Security, Applied AI, and Product Analytics—as well as internal analytics use cases. Today, we run over 400,000 batch jobs per day, process millions of events per second in Flink, and power data governance and analytical workloads through our Lakehouse offering.
At Datadog, we place value in our office culture - the relationships and collaboration it builds and the creativity it brings to the table. We operate as a hybrid workplace to ensure our Datadogs can create a work-life harmony that best fits them.
What You'll Do:
Work alongside teams of engineers and steer the design, implementation and deployment of scalable data processing infrastructure and tools (Flink, Spark, PySpark, Kubernetes).
Enable the productization of Datadog's AI effort by supporting Ray in a managed, scalable and isolated environment.
Collaborate closely with product engineering teams to ensure that the platform provides the best infrastructure for Datadog's products.
Contribute to open-source development of Flink and Spark to make sure they support Datadog's usage.
Identify scalability, reliability and efficiency risks and design solutions to address them, driving the growth of batch and streaming frameworks at Datadog.
Lead and support the team through mentorship, technical leadership, project prioritization and process improvements.
Who You Are:
You are a seasoned software engineer with extensive experience driving architecture and execution for cross-team engineering initiatives focused on distributed systems and stream processing applications
You care about reliability and efficiency, designing resilient and performant systems
You cut through ambiguity, route around blockers and act as a force multiplier for your team, enabling them to ship quickly and safely.
You have experience leading projects that span data engineering, data science and infrastructure.
You're happy to jump into any part of the stack and do whatever's needed to move a project forward
About the company
Datadog (NASDAQ: DDOG) is a global SaaS business, delivering a rare combination of growth and profitability. We are on a mission to break down silos and solve complexity in the cloud age by enabling digital transformation, cloud migration, and infrastructure monitoring of our customers' entire technology stacks. Built by engineers, for engineers, Datadog is used by organizations of all sizes across a wide range of industries. Together, we champion professional development, diversity of thought, innovation, and work excellence to empower continuous growth. Join the pack and become part of a collaborative, pragmatic, and thoughtful people-first community where we solve tough problems, take smart risks, and celebrate one another.