Sr. Advanced Software Engineer

Honeywell

Location: Bengaluru, Karnataka, India
Job type: Full-time

Required skills

Python
AWS
Angular
Ansible
API
Atlassian
automated testing
Azure
Bash
Bazel
caching
capacity planning
CloudFormation
compliance
configuration management
containerization
database
DevOps
DNS
Docker
Flux
frontend
GCP
GitHub
Helm
hybrid cloud
Java
JS
Jenkins
Kubernetes
load balancing
microservices
Node
React
SRE
TCP
Terraform
Vault
version control
Vue
PowerShell

About the role

Honeywell

Website: honeywell.com
Job details:
Job Description

Senior Advanced Software Engineer

Site Reliability Engineering

As a Site Reliability Engineer here at Honeywell Aerospace Technologies, you will play a crucial role as a subject matter expert in ensuring the reliability, availability, and performance of our systems and services. You will work closely with development and operations teams to implement best practices in reliability engineering, automation, and monitoring, driving improvements across our infrastructure.

You will report directly to our SRE Engineering Manager and you’ll work out of our Krakow, Poland location on a Hybrid work schedule as part of a global 24x7x365 team.

In this role, you will impact the efficiency and effectiveness of our operations by enhancing system reliability and performance, ultimately contributing to customer satisfaction and business success.

At Honeywell Aerospace Technologies, our people leaders play a critical role in developing and supporting our employees to help them perform at their best and drive change across the company. Help build a strong, diverse team by recruiting talent, identifying, and developing successors, driving retention and engagement, and fostering an inclusive culture.

Benefits Of Working For Honeywell Aerospace Technologies

In addition to a competitive salary, leading-edge work, and developing solutions side-by-side with dedicated experts in their fields, Honeywell employees are eligible for a comprehensive benefits package. This package includes employer-subsidized Medical, Dental, Vision, and Life Insurance; Short-Term and Long-Term Disability; 401(k) match, Flexible Spending Accounts, Health Savings Accounts, EAP, and Educational Assistance; Parental Leave, Paid Time Off (for vacation, personal business, sick time, and parental leave), and 12 Paid Holidays. For more information visit: click here

The application period for the job is estimated to be 40 days from the job posting date; however, this may be shortened or extended depending on business needs and the availability of qualified candidates.

YOU MUST HAVE

Bachelor’s degree from an accredited institution in a technical discipline such as science, technology, engineering, mathematics.
Minimum of 4 years of experience in site reliability engineering or related fields.
Strong knowledge of cloud services, containerization, and orchestration technologies.
Proficiency in scripting languages such as Python, Bash, or similar.
Experience with monitoring tools and practices, including Prometheus, Grafana, or similar.

WE VALUE

Advanced degree in Computer Science, Engineering, or related field.
Experience with DevOps practices and tools.
Strong analytical and problem-solving skills.
Ability to work collaboratively in a team environment.
Passion for continuous improvement and learning.

About Honeywell

Honeywell International Inc. (Nasdaq: HON) invents and commercializes technologies that address some of the world’s most critical challenges around energy, safety, security, air travel, productivity, and global urbanization. We are a leading software-industrial company committed to introducing state-of-the-art technology solutions to improve efficiency, productivity, sustainability, and safety in high-growth businesses in broad-based, attractive industrial end markets. Our products and solutions enable a safer, more comfortable, and more productive world, enhancing the quality of life of people around the globe. Learn more about Honeywell: click here

About The Role

We are seeking a Site Reliability Engineer (SRE) with strong Database Administration (DBA) skills to ensure the reliability, performance, and scalability of our infrastructure and data platforms. You will work across engineering, operations, and data teams to build resilient systems, automate operations, and maintain mission‑critical databases. You'll create standardized CI/CD frameworks that empower development teams while providing hands-on support to troubleshoot and resolve their build and deployment issues.

This role is ideal for someone who enjoys solving distributed‑systems challenges while also diving deep into database internals, performance tuning, and data reliability.

Good to Have

Experience with additional programming languages (Java, Node.js, Go)
Knowledge of frontend frameworks (React, Angular, Vue.js)
GitOps implementation experience (ArgoCD, Flux)
Service mesh technologies (Istio, Linkerd)
Advanced deployment strategies (blue-green, canary, feature flags)
Database CI/CD and migration automation (Entity Framework, Flyway, Liquibase)
Security scanning tools integration (SonarQube, OWASP, Snyk)
Monitoring and observability tools (Prometheus, Grafana, ELK, Application Insights)
Configuration management (Ansible, Chef, Puppet)
Multi-cloud or hybrid cloud deployment experience
Experience building internal developer platforms
Creating CLI tools or IDE extensions for developer productivity
Policy-as-code implementation (OPA, Sentinel)
Cloud certifications (Azure, AWS, or GCP)
Kubernetes certifications (CKA, CKAD)
Experience with monorepo tools (Nx, Turborepo, Bazel)
API gateway and microservices architecture experience

Tools & Technologies

Cloud: Azure
Containers & K8s: Docker, AKS/EKS, Helm, Istio
Observability: OpenTelemetry, Prometheus/Grafana, Azure Monitor/Log Analytics, Dynatrace, Elastic
CI/CD: GitHub Actions or Azure DevOps Pipelines; canary/blue-green deployments
IaC & Config: Terraform/Terragrunt, Bicep, Vault/Azure Key Vault, SSM
Security: Dependabot; Cosign; OPA

Responsibilities

Key Responsibilities

Reliability Engineering

Define and manage service SLOs/SLIs, track error budgets, and drive reliability roadmaps.
Proactively identify reliability bottlenecks, lead remediation, and preventative actions.
Establish CI/CD best practices and standards across the organization

Observability & Telemetry

Implement and scale metrics, logs, and traces across services (e.g., Prometheus/Grafana, OpenTelemetry, Dynatrace/Azure Monitor, ELK).
Build actionable dashboards and alerts with noise reduction and runbooks for on-call.

Incident Management

Own on-call rotations, triage, and coordination; drive post-incident reviews and blameless RCA with clear corrective actions.
Automate rollback/roll-forward, health checks, and verification steps.

Performance & Capacity

Conduct load and resilience testing; manage capacity planning and cost optimization (autoscaling, right-sizing, caching).
Tune databases, queues, and network settings for throughput and latency.

Automation & Tooling

Reduce toil with automation and self-service tooling; standardize deployment and recovery procedures.
Build reliability guardrails (chaos experiments, circuit breakers, rate limiting, backoff).

Platform & Infrastructure

Operate and harden Kubernetes clusters, container runtimes, and service meshes.
Manage infrastructure using Infrastructure as Code (IaC) - (Terraform/CloudFormation/Bicep), secrets management, and policy-as-code.

Security & Compliance

Implement DevSecOps practices: vulnerability management, dependency scanning, Identity and Access Management (IAM) hardening.

Collaboration

Partner with developers, QA, and product on design reviews, release strategies, and production readiness.
Document standards and provide enablement sessions to elevate reliability practices.
Create comprehensive documentation and self-service guides.

Qualifications

Required Qualifications

Experience: 3–8+ years in SRE/Platform/DevOps/Operations roles with ownership of production systems at scale.
Cloud: Hands-on with AWS/Azure/GCP (preferably two); strong grasp of managed services trade-offs.
Containers & Orchestration: Docker and Kubernetes (AKS/EKS/GKE); Helm/Kustomize; service mesh familiarity (Istio).
Observability: OpenTelemetry; metrics/logs/traces design; alerting strategies; RCA & postmortems.
Infrastructure as Code: Terraform (preferred) or Cloud-native equivalents; modules, remote state, and CI integration.
Programming & Scripting: Proficiency in Python/Go and Bash for automation, tooling, and APIs.
Reliability Practices: SLO/error budgets, capacity planning, chaos/resilience testing, progressive delivery.
Soft Skills: Calm under pressure, strong communication, pragmatic decision-making, and a continuous improvement mindset
Understanding of networking fundamentals (DNS, load balancing, TCP/IP)
Expertise in designing and developing reusable CI/CD pipeline templates
Proficiency with at least two CI/CD platforms (Atlassian, Azure DevOps, Jenkins, GitHub Actions, GitLab CI)
Strong experience with Docker and Kubernetes
Infrastructure as Code skills (Terraform, ARM templates, or CloudFormation)
Cloud platform expertise (Azure, AWS, or GCP)
Experience troubleshooting build and deployment issues across multiple technology stacks
Strong Git and version control workflow knowledge
Experience with automated testing frameworks (.NET: xUnit/NUnit, Python: pytest)
Artifact and package management (NuGet, PyPI, Azure Artifacts, Artifactory)
Scripting skills (PowerShell, Bash, Python)

Click on Apply to know more.

This page is fully interactive when JavaScript is enabled. Please enable JavaScript to apply or browse related roles.