MetLife
Website:
metlife.com
Job details:
Requirements
Description and Requirements
The purpose of the UST RRI SRE lead is to enhance customer satisfaction by continually increasing the stability, reliability, and resiliency of our applications. Our reliability engineers are driven by professional curiosity and a desire to develop a deep understanding of services we support and the technologies they depend upon. Our SRE candidates should have a variety of skills and experiences in software development and/or system engineering and possess the ability to continually learn and pickup new skills.
Key Responsibilities:
- Use problem-solving skills including identification, research, and coordination of resources necessary to effectively troubleshoot and provide advanced diagnostic analysis / root cause identification for high priority incidents.
- Proactively monitor and analyze application patterns and metrics for opportunities to enhance reliability to prevent future incidents as well as opportunities to increase observability.
- Partner with internal and external technology associates to solve complex business needs.
- Work closely with the application development and architecture teams on collaborative technical decisions.
- Identify bottlenecks and problems throughout the infrastructure
- Operate in a DevOps culture, including building relationships with other technical and business teams.
Essential Business Experience and Technology Skills:
Required:
- Basic knowledge in Site Reliability Engineering core principles.
- Basic experience in software programming and framework experience including Java, Nodejs, Python, .NET, groovy, Gremlin, Tinkerpop, Remix/React.
- Working understanding and exposure to monitoring / diagnostic tools including AppDynamics, Elastic (APM, EUM, Synthetics, Logs, Infrastructure), Splunk, Zenoss Routing, Medallia/Decibel, Adobe AEM, VMWare VRops, and Akamai.
- Basic experience in relational and non-relational data solutions such as MSSQL, IBM DB2, Oracle, Mongodb, HBase, Hadoop (Cloudera)
- Basic knowledge of Multi-Tier / Multi-Tenant infrastructure design and solutions.
- Working understanding of application security and implementation of WAF, DDOS, TLS, active directory, SAML, OpenID, OAuth
- Working understanding of 3-Tier Network topologies, Hybrid Datacenter solutions, extranet connectivity, load balancers.
- Prior success extracting/translating findings into alternatives/solutions
- Exposure to knowledge in cloud eco-systems like ESXi, OpenShift, AWS, Docker, Azure, GCP is a plus
- Persistent and driven
- Proven analytical and problem-solving skills
- Ability to work in a fast-paced, multi-tasking, dynamic environment
- Self-motivated and takes initiative
- Strong interpersonal skills and international cultural awareness
- Degree in Computer Science, Information Systems, or equivalent experience
- 2+years of related IT experience
Preferred:
- Experience in production support, application development, infrastructure, or technology operations
- Critical application or system SME knowledge
- Cybersecurity and/or IBM mainframe experience
- Finance/Insurance industry experience
About MetLife
Recognized on Fortune magazine's list of the "World's Most Admired Companies" and Fortune World’s 25 Best Workplaces™, MetLife, through its subsidiaries and affiliates, is one of the world’s leading financial services companies; providing insurance, annuities, employee benefits and asset management to individual and institutional customers. With operations in more than 40 markets, we hold leading positions in the United States, Latin America, Asia, Europe, and the Middle East.
Our purpose is simple - to help our colleagues, customers, communities, and the world at large create a more confident future. United by purpose and guided by our core values - Win Together, Do the Right Thing, Deliver Impact Over Activity, and Think Ahead - we’re inspired to transform the next century in financial services. At MetLife, it’s #AllTogetherPossible . Join us!
Click on Apply to know more.