About the role
Who are we
At BCE Global Tech, immerse yourself in exciting projects that are shaping the future of both consumer and enterprise telecommunications. This involves building innovative mobile apps to enhance user experiences and enable seamless connectivity on-the-go. Thrive in diverse roles like Full Stack Developer, Backend Developer, UI/UX Designer, DevOps Engineer, Cloud Engineer, Data Science Engineer, and Scrum Master; at a workplace that encourages you to freely share your bold and different ideas. If you are passionate about technology and eager to make a difference, we want to hear from you! Apply now to join our dynamic team in Bengaluru.
We're seeking a dedicated Site Reliability Engineer to join our team. In this role, you will be responsible for maintaining the reliability, scalability, and performance of our systems. You'll implement best practices for monitoring, incident response, and automation to ensure seamless operations. Your expertise will help us build resilient infrastructure, reduce downtime, and enhance the overall user experience.
Requirements
Key Responsibilities
· Experience working with various monitoring tools. (eg. ELK, Dyntrace, Cloudwatch, Cloud logging, Cloud Monitoring, BMC Surveyor, BMC Patrol, Grafana, Prometheus)
· Ensure monitoring and self-healing strategies are implemented and maintained to proactively prevent production incidents.
· Perform root cause analysis of production issues
· Design and manage on call and escalation processes – Nice to Have
· Participate in design reviews and production reviews for new features, products, or pieces of infrastructure
· Designing and implementing ELK (Elasticsearch, Logstash and Kibana) stack, Prometheus and Grafana solutions for monitoring and alerting.
· Debug production issues across services and levels of the stack.
· Establish KPIs to demonstrate maturity, efficiency, and value to our business partners
· Works as an integral part of the DevOps team with complimentary skills and common goals
· L3 Support experience is an asset.
· Work to create a Release management process and help with Out-of-business-hour deployments and support (Rotation with team members)
· Familiar and comfortable with agile development techniques.
Technology Skills
· Monitoring and Logging:
· ELK (Elasticsearch, Logstash, Kibana)
· Dynatrace
· Cloudwatch
· Cloud Logging
· Cloud Monitoring
· BMC Surveyor
· BMC Patrol
· Grafana
· Prometheus.
Required qualifications to be successful in this role
· Bachelor's degree in computer science engineering, or related field.
· 8 -10 years of experience as a SRE.
· Proven experience as an SRE, DevOps engineer, or similar role.
· Strong programming skills in languages such as Python, Go, Java, or Ruby.
· Strong problem-solving skills and ability to work under pressure.
· Excellent communication and collaboration skills.
· Flexible to work in EST time zones ( 9-5 EST)
Additional Information:
Job Type: Permanent Full Time
Work Profile: Hybrid (Work from Office/ Remote)
Years of Experience: 8-12 Years
Location: Bangalore
About the company
At BCE Global Tech, immerse yourself in exciting projects that are shaping the future of both consumer and enterprise telecommunications. This involves building innovative mobile apps to enhance user experiences and enable seamless connectivity on-the-go. Thrive in diverse roles like Full Stack Developer, Backend Developer, UI/UX Designer, DevOps Engineer, Cloud Engineer, Data Science Engineer, and Scrum Master; at a workplace that encourages you to freely share your bold and different ideas. If you are passionate about technology and eager to make a difference, we want to hear from you! Apply now to join our dynamic team in Bengaluru.