About the role
As a Senior Software Engineer on our Infrastructure team, you'll play a key role in ensuring the reliability and resilience of FullStory's production systems and services. You'll work on building and maintaining the platforms, systems, and tooling that power FullStory's high-scale, high-availability web application. This includes building and maintaining core infrastructure, automating key processes, and continuing to evolve our architecture and DevOps practices. If you're passionate about building reliable, scalable, and maintainable systems, this role could be a great fit.
What you'll do:
- Design, build, and operate the infrastructure that powers FullStory's web application
- Collaborate with teams across the company to address reliability, scalability, and performance concerns
- Participate in and help evolve FullStory's incident response and SRE practices
- Drive continuous improvement in our cloud infrastructure, CI/CD pipelines, and observability tooling
- Mentor and coach more junior engineers on the Infrastructure team
We're looking for someone with:
- 5+ years of experience building and operating highly available, high-scale distributed systems in a cloud environment
- Strong programming and software engineering skills, particularly in one or more of the following languages: Go, Python, Rust, etc.
- Deep experience with modern infrastructure tooling like Kubernetes, Terraform, Ansible, and Prometheus
- Excellent problem-solving, communication, and collaboration skills
- Experience with incident response and SRE practices
- A passion for building reliable, scalable, and maintainable systems