AI Solutions Architect

Weekday (YC W21)

Location: India
Job type: Full-time

Required skills

LangChain
AWS
Azure
compliance
containerization
database
Docker
end-to-end
Google Cloud
GPU
Kubernetes
microservices
RESTful

About the role

Weekday (YC W21)

Website: weekday.works
Job details:
This role is for one of the Weekday's clients

Min Experience: 4 years

Location: Remote (India)

JobType: full-time

This role requires strong collaboration with engineering, product, and business teams to design robust AI architectures that align with organizational goals while ensuring scalability, performance, and responsible AI practices.

Requirements

Required Qualifications

More than 4 years of experience in roles related to software engineering or architecture, with significant involvement in AI/ML systems
Extensive knowledge of contemporary neural network architectures, such as Transformers, CNNs, and RNNs
Demonstrated ability in developing scalable and distributed architectures for applications driven by AI
Hands-on experience with cloud platforms such as AWS, Azure, or Google Cloud
Proficient in containerization and orchestration technologies, especially with Docker and Kubernetes
Strong understanding of microservices architecture, RESTful APIs, and the design of distributed systems
Familiarity with MLOps / LLMOps pipelines, covering aspects like model training, deployment, monitoring, and lifecycle management
Reasonable understanding of large-scale data systems and modern database technologies
Adept at transforming business requirements into scalable AI solution architectures
Excellent documentation skills for architectural designs, workflows, and technical decision-making processes
Ability to thrive in a startup or fast-paced environment, demonstrating a strong sense of ownership and leadership

Key Responsibilities

Design and oversee the development of scalable generative AI systems and enterprise-grade AI platforms. Establish robust architectures that support model training, inference, monitoring, and lifecycle management in production environments. Direct the selection, customization, and enhancement of state-of-the-art generative AI and large language models.

Develop and execute APIs, microservices, and integration frameworks to incorporate AI capabilities into enterprise applications. Ensure that AI platforms meet stringent standards for performance, reliability, security, and scalability, while also adhering to data governance and privacy regulations.

Collaborate closely with product, engineering, and business teams to outline technical requirements and approaches to AI architecture. Architect end-to-end pipelines for deploying and monitoring AI models, ensuring seamless integration with existing systems.

Guide architectural decisions for LLM applications, AI workflows, and distributed AI infrastructure. Institute best practices for ethical AI development, including strategies to mitigate risks like model hallucinations, bias, and reliability challenges.

Provide technical mentorship and guidance to engineering teams, while contributing to the formulation of long-term technology strategies and the advancement of AI platforms.

Preferred Qualifications

Experience with Generative AI frameworks and orchestration tools such as LangChain, LangGraph, or similar platforms
Expertise in prompt engineering, LLM fine-tuning techniques (LoRA, RLHF, PEFT), and methods for model optimization
Familiarity with performance optimization techniques for AI workloads, including GPU/TPU acceleration, quantization, pruning, and model distillation
Experience with AI observability and monitoring solutions for evaluating model performance, drift, and anomalies
Understanding of AI governance, security, and compliance frameworks such as GDPR or SOC 2

Prior experience in developing enterprise-scale AI or LLM-based products.

Skills

MLOps / LLMOps pipelines

AWS, Azure, or Google Cloud

RESTful APIs

Docker and Kubernetes Click on Apply to know more.

This page is fully interactive when JavaScript is enabled. Please enable JavaScript to apply or browse related roles.