Flag job

Report

Staff Software Engineer

Min Experience

8 years

Location

San Francisco Bay Area

JobType

full-time

About the job

Info This job is sourced from a job board

About the role

As a Staff Software Engineer at JazzX, you'll define the technical vision for our enterprise AI platform, mentor senior engineers, and oversee the design, development, and deployment of critical AI, software and infra services and components. You'll be responsible for ensuring the platform's scalability, reliability, security and observability. What You Will Do Design and Architect Scalable Solutions: Lead the design and development of high-performance, scalable services and components for our enterprise AI platform using a cloud-first, cloud-agnostic approach with world class AI capabilities, scalability, security, privacy and operability. Champion a cloud-first, cloud-agnostic approach to ensure platform versatility, scalability, and future-proof architecture. Champion innovative AI and software design patterns and best practices to ensure code maintainability, reusability, and testability. Contribute to the platform's technical direction by staying up to date on emerging AI technologies, cloud design patterns and user experience methodologies. Collaborative Innovation: Foster a collaborative environment by working closely with cross-functional teams (AI/research engineers, software engineers, architects, product managers, and designers) to translate complex platform and product requirements into elegant, scalable, and modular AI, software and infrastructure components. Foster a collaborative and inclusive environment by effectively communicating technical concepts and design decisions across diverse audiences. Provide mentorship and guidance to junior engineers, fostering their growth and development. Driving Operational Excellence: Oversee the seamless integration of components into the platform, ensuring efficient operation, reliability, and a dependable user experience for both customers and partners. Champion DevOps and AIOps best practices for AI services, maintainability, performance optimization, and observability. Requirements Minimum 8+ years of experience designing, building, and operating cloud-native applications at scale. Expertise in container orchestration frameworks like Kubernetes and containerization technologies like Docker. Expertise in AI/ML/data capabilities, platforms, frameworks and tools. Proven track record deploying and managing cloud environments using Infrastructure as Code (IaC) tools like Terraform, Ansible, or CloudFormation. Experience in designing and implementing highly available, scalable, and fault-tolerant systems. Development Lifecycle: End-to-end ownership of the entire software development lifecycle (SDLC) for complex enterprise platforms and products. This includes experience in design, prototyping, development (including unit, integration, and end-to-end testing), deployment, and ongoing operations. Technical Skills: Mastery of essential software development principles with a focus on clean code, design patterns, and best practices. In-depth expertise in at least two of the following programming languages: Java, JavaScript (Node.js preferred), Python, Golang, or C++. Solid understanding of distributed systems concepts and experience with distributed tracing tools. AI/ML Experience: Demonstrated experience in building, deploying, and managing Machine Learning (ML) and Artificial Intelligence (AI) platforms and services. Familiarity with MLOps practices, frameworks (TensorFlow, PyTorch), and tools is a plus. Familiarity with modern LLMs and AI agent frameworks like LangChain, LlamaIndex, AutoGen etc. is a strong plus.

About the company

SAI Group is a private investment firm that has committed $1 billion to incubate and scale revolutionary AI-powered enterprise software application companies. Our portfolio, a testament to our success, comprises rapidly growing AI companies that collectively cater to over 2,000+ major global customers, approaching $600 million in annual revenue, and employing a global workforce of over 4,000 individuals.

Skills

cloud-native applications
Kubernetes
Docker
AI
ML
Terraform
Ansible
CloudFormation
Java
JavaScript
Python
Golang
C++