Senior Platform Engineer
Location: Remote-first | Department: Engineering - Platform
Reports to: Engineering Manager, Platform
About Shipsy
Shipsy is a leading SaaS platform powering logistics transformation for global enterprises. Our smart transport management platform integrates deeply with supply chain systems to streamline operations, enhance visibility, and deliver real-time insights. We’re growing fast—and our infrastructure needs to scale just as fast.
About the Role
We’re looking for a Senior Platform Engineer to help scale and harden our platform that powers mission-critical logistics solutions. You’ll be at the core of our infrastructure evolution—driving reliability, security, and developer experience across our environments.
This is a high-impact, hands-on engineering role with ownership over key initiatives including infrastructure automation, observability, platform security, and multi-tenant compute setup.
What You’ll Do
• Infrastructure as Code: Manage and evolve our AWS-based infrastructure (ECS, RDS, Redshift, S3/MinIO, Elasticsearch) using Terraform.
• Platform Reliability: Improve and maintain a reliable, secure platform running hundreds of services. You’ll own key parts of our ECS orchestration, network policies, and IAM boundaries.
• Observability: Drive observability standards across services using New Relic, consolidated test reports, and enhanced logs/traces.
• CI/CD & Developer Experience: Optimize pipelines, introduce best practices for retries, parallelism, and test reliability. Own platform tooling that helps developers ship confidently.
• Multi-Tenancy & Sandboxes: Lead the design and implementation of dedicated sandbox environments to support isolated tenant workloads and testing.
• Security & Access Controls: Reinforce least-privilege principles (e.g., no-root access), manage privilege escalation workflows via AWS Connect, and monitor compliance.
• Collaboration: Work closely with Backend, Security, and QA teams to ensure the platform supports their goals with speed and safety.
• Mentorship: Guide mid-level engineers through platform patterns, tooling, and infra reliability techniques.
What You Bring
• 5+ years of experience in infrastructure/platform/backend engineering.
• Deep understanding of AWS services: ECS (Fargate), RDS (PostgreSQL), S3, IAM, CloudWatch.
• Experience with Terraform or similar IaC tooling.
• Strong experience with observability: logs, metrics, traces, alerting (e.g., New Relic, ELK).
• Solid experience with Docker and container orchestration principles.
• Proficiency in scripting languages like Python, Bash, or Go.
• Comfort working in secure, production-grade environments with compliance boundaries (e.g., no-root, restricted shells).
• Experience owning uptime, performance, and deployment reliability across services.
Nice to Have
• Experience with MinIO, Redshift tuning, and Elasticsearch scaling.
• Knowledge of CI/CD systems like GitHub Actions or Jenkins.
• Exposure to secure transport systems or logistics-heavy platforms.
• Familiarity with flaky test analysis, sandbox isolation strategies, and test reliability tooling.
What Success Looks Like
• Platform SLAs improve across uptime, latency, and incident resolution.
• Infrastructure is fully codified, version-controlled, and reproducible.
• Teams experience faster feedback cycles and more predictable deploys.
• Sandbox environments are scalable, secure, and isolated by tenant/workload.
• Observability and debug tooling are accessible and insightful across the stack.