Flag job

Report

Senior Site Reliability Engineer, IaaS

Location

Paris, France

JobType

full-time

About the job

Info This job is sourced from a job board

About the role

Algolia is set to enable every company to create world-class Search and Discovery experiences with an API-first approach. Performance and Scalability is at the heart of our mission: we power 1.5 trillion searches a year, for 10K+ customers all over the world. If you're a problem solver, able to think outside the box and eager to nurture others and learn from them, then this is your challenge! The Team The Infrastructure as a Service (IaaS) team aims at upholding the reliability and scalability we expect from Algolia's infrastructure for its critical systems and products. Our focus is on enabling teams across Algolia to leverage this infrastructure while keeping it under control through an always increasing level of automation. The Opportunity The Senior Site Reliability Engineer position within the IaaS team provides a dynamic opportunity for a professional with foundational experience in maintaining and optimising scalable infrastructures. This role specifically concentrates on three key areas: Server and container hosting, cloud and network expertise and flawless observability. As a Senior Site Reliability Engineer (SRE), you will play a pivotal role in designing, implementing, and maintaining highly available, scalable, and fault-tolerant systems. Your work will directly impact the effectiveness and productivity of teams and clients, as you enhance infrastructure reliability and streamline operations. You will collaborate closely with cross-functional teams, mentor peers, and lead technical projects, ensuring that Algolia's infrastructure scales seamlessly and operates reliably. Your Role Will Consist Of: Kubernetes and Cloud Services Leadership - Oversee and enhance Kubernetes-based architectures and cloud services, ensuring fault tolerance, optimal resource utilization, and seamless scalability. Advanced System Management and Configuration - Lead the improvement of infrastructure code and automation, managing a fleet of thousands of servers to maintain safety, efficiency, and reliability. Control Plane Advancement - Architect and extend the control plane into a comprehensive platform that empowers teams to develop performant and scalable products, with reliability baked into the foundation. Cross-Team Collaboration and Leadership - Collaborate with and mentor team members while solving complex technical challenges, fostering a culture of ownership and shared accountability across teams. Engineering Process Leadership - Define and implement engineering processes and best practices to ensure the delivery of high-quality, reliable, and scalable systems. Proactive Problem-Solving and Innovation - Identify and resolve systemic issues, implement innovative solutions, and drive continuous improvement in infrastructure and operations.

About the company

Algolia prides itself on being a pioneer and market leader offering an AI Search solution that empowers 17,000+ businesses to compose customer experiences at internet scale that predict what their users want with blazing fast search and web browse experience. Algolia powers more than 30 billion search requests a week – four times more than Microsoft Bing, Yahoo, Baidu, Yandex and DuckDuckGo combined. Algolia is part of a cadre of innovative new companies that are driving the next generation of software development, creating APIs that make developers' lives easier; solutions that are better than building from scratch and better than having to tweak monolithic SaaS solutions. In 2021, the company closed $150 million in series D funding and quadrupled its post-money valuation of $2.25 billion. Being well capitalized enables Algolia to continue to invest in its market leading platform, to better serve its thousands of customers–including Under Armor, Petsmart, Stripe, Gymshark, and Walgreens, to name just a few.

Skills

kubernetes
linux
python
ruby
golang
cloud
distributed-systems