Fission Labs
Website:
fissionlabs.com
Job details:
Role: Sr AWS DevOps Engineer - 66304
Experience: 5-9 years
No. of positions: 2
Location: Hyderabad (Hybrid)
Responsibilities
- Advanced Linux & OS Troubleshooting: Perform in-depth analysis of system issues including disk I/O bottlenecks, kernel panics, service failures, and networking issues using tools like strace, journalctl, netstat, iostat, lsof, etc.
- DR Solution Design & Build: Architect, provision, and configure a comprehensive Disaster Recovery solution on AWS — including compute, networking, storage, and failover components aligned to defined RPO/RTO objectives
- Infrastructure Enhancements: Identify and implement additional infrastructure improvements to strengthen the reliability, scalability, and security posture of the DR environment, including advanced networking using Transit Gateway, VPC Peering, and traffic segmentation.
- CI/CD Pipeline Development: Design and maintain Jenkins-based CI/CD pipelines to automate infrastructure provisioning, application deployments, and environment configuration across dev, staging, and production environments using CloudFormation.
- Risk Remediation & Security Hardening: Assess and remediate identified risks across IAM roles and policies, Secrets Manager configurations, encryption standards, WAF rules, and third-party dependencies to meet security and compliance requirements.
- Disaster Recovery Testing & Validation: Execute and support DR testing activities including RPO/RTO validation, cross-region replication verification, Multi-AZ failover simulations, and Route 53 DNS failover testing.
- Technical & DR Documentation: Author and maintain technical documentation including DR Operational Readiness Documents (ORDs), runbooks, and architecture diagrams using provided templates and guidance.
- Monitoring & Observability: Configure and maintain AWS CloudWatch dashboards, alarms, and log groups to provide full visibility into DR infrastructure health, incidents, and automated recovery actions.
- Deployment Support & Warranty: Provide hands-on deployment support during go-live and deliver post-deployment warranty coverage to ensure environment stability and rapid resolution of any issues.
- GitLab to GitHub Migration: Support the potential migration of source code repositories from GitLab to GitHub, including pipeline reconfiguration, branch strategy alignment, and post-migration validation, as bandwidth permits.
- Scripting & Automation: Develop and maintain Shell and Python scripts to automate routine operational tasks, DR workflows, risk-remediation actions, and event-driven processes — with observability and guardrails built in.
Skill Set Required For DevOps Engineers
- Cloud - AWS Core Services:
- EC2, S3, VPC
- IAM
- CloudFront
- API Gateway
- Lambda
- Secrets Manager
- WAF
- Route 53
- CloudWatch
- ELB
- Cloud - Advanced Networking and Connectivity:
- AWS Transit Gateway (TGW)
- VPC Peering (cross‑account, cross‑region)
- Route table design & traffic segmentation
- NACL vs Security Group design and enforcement
- Private connectivity patterns (AppGate / ZTNA concepts)
- Infrastructure as Code: CloudFormation
- Standard CI/CD: Jenkins
- Monitoring: AWS CloudWatch
- Disaster Recovery:
- RPO/RTO planning
- Cross-region replication
- AWS Backup
- Multi-AZ failover
- Route 53 DNS failover, etc.
- Scripting : Shell, Python
- Tools : JIRA, GitHub/GitLab
AI skills (nice to have): Event‑driven and agentic automation, Python‑based orchestration, secure AI/API integration, decision guardrails, and full observability for automated DR and risk‑remediation actions.
Click on Apply to know more.