Report

Site Reliability Engineer

Location

Bengaluru, Karnataka, India

JobType

full-time

About the job

Info This job is sourced from a job board

Overview

About the role

Mumba Technologies, Inc.

Website: mumbatech.com
Job details:

Job Description

We are looking for a Staff Site Reliability Engineer to help us grow our domain expertise and provide support in a new global region to enable 24x7 development velocity as a global company. From AWS cloud provisioning as code to improving the developer experience in your working timezone, to acting as a guide to best practices around building and delivering software globally, we need an SRE with the passion, motivation, and great ideas to make everything better.

What you’ll do

· Automate the provisioning of all of infrastructure in code. Everything we do is in code!

· Partner with our Platform Engineering team on building developer tooling / improving developer experiences via joint initiatives and enhancements.

· Partner with our Data Engineering team on improving our data posture and driving operational excellence.

· Evolve our deployment pipelines to automate infrastructure deployments with the latest and greatest (and reliable) technologies.

· Improve metrics on our main services, and act as a subject matter expert for our global dev teams.

· Enable observability, SLO/SLI reporting, and respond to business impacting incidents as it pertains to infrastructure.

· Adopt and drive solutions that align with AWS Well Architected frameworks and business objectives.

· Identify performance bottlenecks and provide recommendations for improvement.

· Proactively identify and solve problems that we didn’t even know we had.

· Help build, deploy, and scale a load testing environment that is analogous to production.

· Enforce security and operational safety controls.

· Participate in technical roadmap planning and estimation.

· Participate and contribute in production readiness and architecture review board (ARB) meetings and forums.

· Train and mentor future engineers in the same region.

· Contribute to the architectural improvements to meet future scaling and observability requirements

Qualifications

· A profound love for solving hard problems and overcoming challenging obstacles.

· Putting your customers first, whether they be internal or external, and making them more productive, happy, and successful.

· Experience with AWS. Other public cloud providers are a bonus.

· Experience with PostgreSQL is a must. Additional experience with document databases is a nice-to-have.

· Experience with cloud security best practices (CSPM, CDR, CWPP, SIEM, etc) to keep our customers and cloud posture secure.

· Experience with containers (builds, registries, vulnerabilities scanning, run-time with docker-compose, run-time with TILT, run-time in schedulers/orchestration systems).

· Multi-year hands-on experience and fluency with Kubernetes and helm charts are an absolute skill requirement. We live and breathe the k8s ecosystem.

· Experience with a CI/CD pipeline. We use a combination of Github Actions, ArgoCD, Helm and GitOps in our deployment process, but again, any are fine.

· Some sort of infrastructure-as-code system: Ansible, Terraform, CloudFormation, CDK, etc.

· We use Python and Typescript, so knowledge and exposure with either is a strong plus.

· Experience breaking up monolithic architectures into microservices

· Experience with service meshes and service discovery solutions.

· Experience with an observability solution: New Relic, Prometheus, DataDog, etc.

· Experience with logging systems: CloudWatch, ELK, Splunk, etc.

· Bachelor’s degree in Computer Science or similar or equivalent experience

Click on Apply to know more.

Skills

Python

SIEM

AWS

Ansible

business objectives

CloudFormation

CloudWatch

Datadog

docker-compose

GitHub

Helm

infrastructure-as-code

K8s

Kubernetes

microservices

PostgreSQL

Splunk

SRE

Terraform

TypeScript