Crossover
Website:
crossover.com
Job details:
You're the engineer who keeps more than 50 SaaS products running when others are second-guessing. We need DevOps engineers who can step into unknown AWS environments, restore order, and drive uptime beyond 99.9% through effective monitoring, automation, and rigorous root-cause analysis. You'll break down complex projects into single-day tasks, deliver production-ready Python or JavaScript, and leverage AI as your assistant.
Most organizations talk about "cloud" while manually maintaining individual servers. We're scaling reliability across dozens of acquired products where the original builders have departed and documentation is incomplete. That's the challenge: you'll use agents and current tooling to explore unfamiliar systems 5–10x faster, document your findings, and automate them so the same outage never recurs. Rather than judging you on certifications and vendor badges, we'll observe you troubleshoot in real time, produce a genuine 5-Whys that identifies one preventable root cause, and create automations that withstand production conditions.
This is not an L2 "execute the runbook" position. In this role, you author the runbooks, architect the deployment from dev to staged to 10% to 100% with soak periods and rollback triggers, and construct the monitors that detect edge cases. You reject risky changes before they're deployed. You distinguish infrastructure failures you own from application bugs Engineering owns, and you route permanent fixes to the correct team.
You'll operate at the engineering center of reliability, owning infrastructure projects, incident response and RCAs, and change requests with copy-paste-executable runbooks. If you've already owned a significant SaaS product and want to extend that discipline to a fleet, join us. Bring expert-level AWS, production-grade coding skills, rigorous scope control, and daily, critical use of AI tools. If you're prepared to keep the lights on, please apply.
What You Will Be Doing
- Complex infrastructure migrations, consolidations, production-grade automations, monitoring changes
- Triaging production outages, deploying immediate fixes, and producing root cause analyses with permanent fixes assigned to the responsible teams
- Creating, reviewing, and executing production changes, including validating whether a proposed change is safe to execute
What You Won’t Be Doing
- Living in Jira and endless status meetings - we value people who can drive solutions, not just track problems
- Maintaining outdated systems indefinitely - you'll be empowered to drive meaningful improvements
- Getting blocked by bureaucratic approval chains - you'll have the authority to execute immediate fixes to resolve incidents
Cloud Architect Key Responsibilities
- Drive reliability and standardization of cloud infrastructure across our growing product portfolio by implementing robust monitoring, automation, and AWS best practices.
Basic Requirements
- Deep AWS infrastructure expertise (this is our primary platform - other cloud experience alone won't cut it)
- Experience owning large production infrastructure and troubleshooting production outages independently (not just following a runbook)
- Experience scripting with Python and Bash for day-to-day administration operations
- Experience managing and migrating production databases with multiple engines (including MySql, Postgres, Oracle, MS-SQL)
- Experience with infrastructure automation (Terraform, Ansible, or CloudFormation)
- Linux systems administration expertise
About Trilogy
Hundreds of software businesses run on the Trilogy Business Platform. For three decades, Trilogy has been known for 3 things: Relentlessly seeking top talent, Innovating new technology, and incubating new businesses. Our technological innovation is spearheaded by a passion for simple customer-facing designs. Our incubation of new businesses ranges from entirely new moon-shot ideas to rearchitecting existing projects for today's modern cloud-based stack. Trilogy is a place where you can be surrounded with great people, be proud of doing great work, and grow your career by leaps and bounds.
There is so much to cover for this exciting role, and space here is limited. Hit the Apply button if you found this interesting and want to learn more. We look forward to meeting you!
Working with us
This is a full-time (40 hours per week), long-term position. The position is immediately available and requires entering into an independent contractor agreement with Crossover as a Contractor of Record. The compensation level for this role is $50 USD/hour, which equates to $100,000 USD/year assuming 40 hours per week and 50 weeks per year. The payment period is weekly. Consult www.crossover.com/help-and-faqs for more details on this topic.
Crossover Job Code: LJ-5236-IN-NewDelhi-CloudArchitect.002
Click on Apply to know more.