Billtrust
Website:
billtrust.com
Job details:
As a Database Reliability Engineer within our Operations Engineering Center, you'll ensure the reliability, performance, and scalability of Billtrust's mission-critical database infrastructure. You'll work with MySQL SQL Server, MongoDB, and modern cloud-native databases (Aurora, Cloud SQL), supporting solutions that support billions of transactions annually.
You'll drive automation through infrastructure-as-code, develop AI-powered database health monitoring systems, and implement intelligent backup and disaster recovery strategies. Your expertise will enable Billtrust GCC to deliver world-class database reliability while pioneering autonomous database management through agentic technologies.
Experience Level: 5-9 yrs
work Location: Hyderabad
Key Responsibilities
- Design, implement, and maintain high-availability and disaster recovery architectures for MySQL, SQL Server, and MongoDB databases
- Develop and execute database performance tuning strategies, query optimization, and workload analysis
- Implement infrastructure-as-code for database provisioning using Terraform and Ansible
- Engineer autonomous database health monitoring agents and agentic backup validation systems
- Establish backup, restore, and recovery automation using cloud-native tools
- Create and maintain operational runbooks for database failover, capacity planning, and incident response
- Implement database-as-code practices and schema migration automation
- Monitor and optimize database metrics using AI Ops and Datadog dashboards
- Collaborate with application teams on database design, indexing strategies, and performance requirements
- Work with the US database engineering team to support & implement roadmap
Required Qualification
Experience & Technical Background
- 5+ years of hands-on experience as a Database Administrator or Database Reliability Engineer
- Expert-level proficiency with MySQL, SQL Server, or MongoDB (at least one to expert level)
- Strong experience with cloud-native databases (Amazon Aurora, AWS RDS, MongoDB Atlas)
- Demonstrated expertise in backup, restore, disaster recovery, and high-availability architectures
- Strong background in infrastructure-as-code using Terraform, CloudFormation, or Ansible
- Experience with monitoring platforms (Datadog, Grafana, Prometheus) and observability practices
- Proficiency with Linux/Unix and shell scripting for automation
- Hands-on experience with cloud platforms (AWS, Azure, or GCP)
- Proficiency using Claude Code, Github Copilot or similar AI coding assistance
Soft Skills & Attributes
- Strong problem-solving skills with methodical approach to troubleshooting complex database issues
- Excellent communication skills to collaborate across teams and explain technical concepts
- Proactive mindset focused on automation and toil reduction
- Detail-oriented with strong focus on reliability and data integrity
- Commitment to continuous learning and staying current with database technologies
AI & Autonomous Agent Engineering Requirements
1 . Autonomous Database Health Agents
Engineer autonomous monitoring agents that continuously assess database health, predict capacity bottlenecks, and trigger proactive scaling actions before performance degrades.
2.AI-Powered Query Optimization
Develop intelligent agents that analyze query execution plans, identify optimization opportunities, and autonomously implement index recommendations and query rewrites.
3.Agentic Backup & DR Validation
Build agentic workflows that autonomously execute and validate backup/restore operations, disaster recovery runbooks, and failover procedures on scheduled and event-driven triggers.
4.Intelligent Anomaly Detection
Develop machine learning models and autonomous agents that detect anomalous database behavior — deadlocks, replication lag spikes, connection pool exhaustion — and initiate self-healing remediation.
5.Natural Language Database Ops
Build conversational interfaces that allow operations teams to query database status, execute routine maintenance, and troubleshoot issues using natural language commands.
If you 're excited to work on Database Reliability Engineers at Bill trust GCC are expected to actively develop and deploy AI powered automation that transforms database operations from reactive to autonomous.
Click on Apply to know more.