About the role
This is a full time remote role for candidates in the APAC (Australia, Singapore and India) timezones.
What is Grafana Cloud?
Grafana Cloud is our composable observability platform that integrates metrics, logs, traces, and profiles with Grafana. It allows our customers to leverage the best open source observability software – including Prometheus, Mimir, Loki, Tempo, and Pyroscope – without the overhead of installing, maintaining and scaling their own observability stack.
The Databases team owns and operates the telemetry databases that are Mimir for metrics, Loki for logs, Tempo for traces, and Pyroscope for profiles. Our databases are offered as a hosted service in Grafana Cloud, and additionally as on-premise solutions with Grafana Enterprise Metrics, Grafana Enterprise Logs, and Grafana Enterprise Traces. They are multi-tenant distributed systems implemented in Go and operating at scale on Kubernetes across all major Cloud service providers (AWS, GCP, Azure).
As a company we are remote-first and global, we embrace people of different experiences and backgrounds to build diverse teams where every person brings a new perspective to the software.
Mimir Squad
The Mimir squad has two sub-squads, Ingest and Query, which together maintain the Mimir OSS project, and additionally own and operate Grafana Cloud Metrics across three major cloud providers. Engineers on the team focus on optimizing the efficiency and resilience of processing, storing, and querying metrics at large volumes. These services operate at a large scale and performance is key to keeping the offering competitive and running smoothly.
A Mimir engineer has various work streams. They likely are engaged in a larger project with another engineer, as well as mixing in some performance and reliability improvements discovered through operating the system in production. They are also responsible for writing and reviewing PRs and design documents from other engineers in the squad, shepherding automated release rollouts, and participating in the on-call rotation for their systems.
What will you be doing?
Take a very active role in influencing our roadmap and your own career objectives
Work with your team to deliver new features, then use the results to iterate and improve.
Drive projects from initial idea all the way to operations once it is in the hands of customers
Embrace our open-source culture and contribute to other projects that may not directly fall within your team's scope
Design, build, operate, and maintain critical systems, owning the reliability, performance, and availability
Be a part of your team's follow-the-sun on-call rotations and take ownership of the services you're running
Support other team members, participate in design discussions and collaborate with the team
Learn new skills by gaining a deeper understanding of our cloud product and our customers and getting to know the codebase of a large distributed system
As we are remote-first and our engineering organization is entirely remote, we provide guidance and meet regularly using video calls, so an independent attitude and good communication skills are a must.
About the company
There are more than 20M users of Grafana, the open source visualization tool, around the globe, monitoring everything from beehives to climate change in the Alps. The instantly recognizable dashboards have been spotted everywhere from a NASA launch and Minecraft HQ to Wimbledon and the Tour de France. Grafana Labs also helps more than 3,000 companies -- including Bloomberg, JPMorgan Chase, and eBay -- manage their observability strategies with the Grafana LGTM Stack, which can be run fully managed with Grafana Cloud or self-managed with the Grafana Enterprise Stack, both featuring scalable metrics (Grafana Mimir), logs (Grafana Loki), and traces (Grafana Tempo).