The NexTier Technology team is looking for a Site Reliability Engineer (SRE) to help build, scale, and maintain highly reliable systems on Google Cloud Platform (GCP). This role blends software engineering with infrastructure expertise to ensure our services are performant, resilient, and cost-efficient.
Candidates will work closely with engineering teams to improve system reliability, automate operations, and embed best practices across the platform.
3+ years of experience in Site Reliability Engineering, DevOps, or similar roles
Strong experience with Google Cloud Platform (GCP) services (e.g., Compute Engine, GKE, Cloud Run, Cloud Storage)
Experience with containerization and orchestration (Docker, Kubernetes)
Proficiency in at least one programming language (e.g., Python, Go, Java)
Experience with Infrastructure as Code (Terraform preferred)
Solid understanding of networking, security, and distributed systems
Experience with monitoring and logging tools (e.g., Prometheus, Grafana, Cloud Monitoring)
Familiarity with CI/CD pipelines and automation tools
Hands-on experience with Google Cloud Platform, including:
GCP: GKE, Compute Engine, Cloud Storage, Pub/Sub (or equivalents)
Cloud Monitoring & Logging
BigQuery
Dataflow
Datastream
IAM and networking
Composer/AIrflow
Kubernetes: deployment, scaling, reliability patterns
CI/CD: GitHub Actions, GitLab CI, or similar
Observability: GCP Cloud Monitoring, Logging
Experience operating systems in 24/7 production environments
The Evolving Oil Field Demands Evolving Service Providers
NexTier is a leading provider of integrated completions that employs sustainable practices and equipment to support our customers’ ESG goals while accelerating production in the most demanding US land basins.