AI First - Principal Engineer

DigitalT3

full-time

Required skills

LangChain
Python
Agile
communication skills
containerization
Docker
ECS
end-to-end
FastAPI
frontend
full stack
GitHub
JS
Jira
Kubernetes
Lambda
microservices
Node
React
Terraform
TypeScript

About the role

DigitalT3

Website: digitalt3.com
Job details:

AI FIRST – Principal Engineer

Tamil Nadu (Remote) - Full-time (40 hrs/week)

Experience: 8–10 Years

Job Description

We're looking for a Principal Engineer to serve as a technical anchor for our AI-first engineering organization. In this role, you'll drive the architecture and technical direction of our most complex systems, define engineering standards across the team, and ensure our AI-powered products are built to scale. You'll work closely with the Tech Lead and senior stakeholders to translate business goals into robust, forward-thinking technical solutions — while also being deeply hands-on when it matters.

Key Responsibilities

Full Stack Development

Lead the technical design of large-scale, cross-cutting full-stack initiatives.
Define and enforce engineering standards, patterns, and best practices across teams.
Implement full-stack features using React.js, Next.js, and TypeScript on the frontend, with Python (FastAPI) and/or Node.js backends.
Architect and oversee microservices, APIs, and service integration strategies.
Own and evolve AWS-based infrastructure for web and backend workloads, using Terraform for infrastructure as code.
Lead containerization and orchestration strategy using Docker and Kubernetes.
Drive system reliability, security architecture, and cost optimization across services.
Size and estimate projects across full-stack and AI workstreams; guide planning and prioritization.
Operate within an Agile delivery model, using Jira for planning and cross-team coordination.
Mentor senior and mid-level engineers; champion high engineering standards through code and design reviews.

AI Engineering

Set the technical direction for GenAI development, including model selection, pipeline architecture, and scalability strategy.
Define and own best practices for LLM development, evaluation, and production deployment.
Architect end-to-end LLM pipelines from experimentation to production, including RAG systems and retrieval infrastructure.
Develop and standardize advanced prompt engineering frameworks across the team.
Lead rigorous model evaluations and define performance optimization strategies.
Design and oversee CI/CD pipelines tailored for AI systems.
Architect and maintain AWS-based infrastructure for AI workloads (SageMaker, etc.).
Build and evolve monitoring, observability, and evaluation tooling for LLM applications.

Candidate Requirements

8–10 years of software engineering experience, including meaningful time in senior or lead roles.
Deep full-stack development expertise using React.js, Next.js, and TypeScript.
Strong proficiency in Python (FastAPI) and/or Node.js for backend services.
Proven track record of building and deploying LLM or GenAI applications in production at scale.
Deep understanding of prompt engineering, LLM evaluation, and RAG systems.
Expert-level knowledge of AWS (SageMaker, Lambda, ECS, S3, etc.), Docker, and Kubernetes.
Strong experience with Terraform and CI/CD pipelines (GitHub Actions or similar).
Excellent background in system design, API architecture, distributed systems, and microservices.
Experience driving engineering standards, cross-team alignment, and technical decision-making.
Strong communication skills — able to influence technical direction with peers and leadership.

Good to Have

Experience with fine-tuning, RLHF, or training LLMs.
Familiarity with LangChain, LlamaIndex, or similar frameworks.
Background in MLOps, model monitoring, and AI observability.
Experience with A/B testing and experimentation frameworks.
Open-source contributions or thought leadership in GenAI or distributed systems.

Click on Apply to know more.

This page is fully interactive when JavaScript is enabled. Please enable JavaScript to apply or browse related roles.