Crossover
Website:
crossover.com
Job details:
You're the engineer who maintains uptime for 50+ SaaS products when others are making educated guesses. We need DevOps engineers who can step into unfamiliar AWS environments, restore order from chaos, and drive availability above 99.9% using actual monitoring, automation, and root cause analysis. You'll break down complex projects into daily deliverables, deploy production-ready Python or JavaScript, and leverage AI as your assistant.
Most organizations talk about "the cloud" while manually tending individual servers. We're building industrial-grade reliability across dozens of acquired products where original teams have departed and documentation is incomplete. That's where it gets interesting: you'll apply agents and current tooling to understand new environments 5–10x faster, document your findings, and automate processes so repeat incidents become impossible. Rather than judging you on certifications and vendor logos, we'll observe you troubleshoot in real time, produce a genuine 5-Whys analysis that identifies one preventable root cause, and create automations that withstand production conditions.
This is not an L2 "follow the runbook" position. Here, you author the runbooks, architect the deployment path from development through staging to 10% then 100% with soak periods and rollback conditions, and implement the monitoring that detects corner cases. You reject risky changes before execution. You distinguish infrastructure failures you own from application bugs Engineering owns, and you route permanent remediation to the correct team.
You'll operate at the engineering center of reliability, managing infrastructure initiatives, incident response with RCAs, and change requests accompanied by copy-paste-ready runbooks. If you've already managed a significant SaaS product and want to expand that expertise across a portfolio, join us. Bring expert AWS knowledge, production-quality development skills, strict scope discipline, and daily, essential use of AI tooling. If you're prepared to maintain operational continuity, please apply.
What You Will Be Doing
- Advanced infrastructure migrations, consolidation projects, production-quality automations, and monitoring implementations
- Diagnosing production incidents, deploying immediate remediations, and authoring root cause analyses with permanent fixes routed to responsible teams
- Drafting, reviewing, and deploying production changes, including assessing whether a proposed change meets safety criteria for execution
What You Won’t Be Doing
- Spending time in Jira and perpetual status calls - we value engineers who deliver solutions, not just document issues
- Supporting legacy systems without end - you'll be authorized to implement substantial improvements
- Waiting on bureaucratic approval processes - you'll possess the authority to deploy immediate fixes during incidents
DevOps Architect Key Responsibilities
- Drive reliability and standardization of cloud infrastructure across our expanding product portfolio by deploying comprehensive monitoring, automation, and AWS best practices.
Basic Requirements
- Deep AWS infrastructure expertise (this is our primary platform - other cloud experience alone won't cut it)
- Experience managing production infrastructure at a scale of 1,000+ containers
- Experience scripting with Python and Bash for day-to-day administration operations
- Experience managing and migrating production databases with multiple engines (including MySql, Postgres, Oracle, MS-SQL)
- Experience with infrastructure automation (Terraform, Ansible, or CloudFormation)
About Trilogy
Hundreds of software businesses run on the Trilogy Business Platform. For three decades, Trilogy has been known for 3 things: Relentlessly seeking top talent, Innovating new technology, and incubating new businesses. Our technological innovation is spearheaded by a passion for simple customer-facing designs. Our incubation of new businesses ranges from entirely new moon-shot ideas to rearchitecting existing projects for today's modern cloud-based stack. Trilogy is a place where you can be surrounded with great people, be proud of doing great work, and grow your career by leaps and bounds.
There is so much to cover for this exciting role, and space here is limited. Hit the Apply button if you found this interesting and want to learn more. We look forward to meeting you!
Working with us
This is a full-time (40 hours per week), long-term position. The position is immediately available and requires entering into an independent contractor agreement with Crossover as a Contractor of Record. The compensation level for this role is $50 USD/hour, which equates to $100,000 USD/year assuming 40 hours per week and 50 weeks per year. The payment period is weekly. Consult www.crossover.com/help-and-faqs for more details on this topic.
Crossover Job Code: LJ-5236-IN-Mumbai-DevOpsArchitec.006
Click on Apply to know more.