Senior/Lead Generative AI Engineer

DATAECONOMY

Location: Hyderabad, Telangana, India
Job type: Full-time

Required skills

LangChain
Python
AWS
backend
cloud infrastructure
CloudWatch
Docker
DynamoDB
EC2
ECS
FastAPI
kernel
Kubernetes
Lambda
machine learning
microservices
NumPy
Pandas
PF
Ray
TensorFlow
Pytorch
REST APIs

About the role

Website: dataeconomy.ai
Job details:
Job Location- Hyderabad

Experience- 5+years

We are looking for a Senior Generative AI Lead with strong expertise in Generative AI, Python backend development, and AWS cloud to design and build production-grade AI applications. The ideal candidate should have hands-on experience building LLM-powered systems, scalable backend services, and cloud-native AI solutions.

Requirements

Responsibilities

Design and build production-grade Generative AI applications such as AI assistants, enterprise chatbots, document intelligence systems, knowledge copilots, and AI-powered automation platforms.
Develop and deploy LLM-powered applications using orchestration frameworks such as LangChain, LangGraph, LlamaIndex, Strands, AutoGen, CrewAI, Haystack, DSPy, and Semantic Kernel.
Build advanced Retrieval Augmented Generation (RAG) systems including Graph RAG, Hybrid RAG, multi-hop retrieval, Agentic RAG, and knowledge graph–based retrieval pipelines.
Develop AI agents and multi-agent systems capable of reasoning, tool usage, and task orchestration using frameworks such as LangGraph, AutoGen, CrewAI, and Strands.
Build Python backend services and scalable APIs using FastAPI and modern backend frameworks while following microservices architecture principles.
Design scalable backend architectures for AI applications with asynchronous processing, task queues, and distributed workloads.
Integrate foundation models and LLM providers such as OpenAI, Anthropic Claude, LLaMA and open-source LLMs, Google Gemini, and Hugging Face models.
Implement document ingestion pipelines including chunking strategies, embedding generation, metadata enrichment, indexing, and semantic retrieval.
Build semantic search and vector retrieval systems using vector databases such as Pinecone, Weaviate, Milvus, FAISS, ChromaDB, Qdrant, and OpenSearch vector search.
Implement embedding pipelines using embedding models from OpenAI, Hugging Face, Sentence Transformers, or similar providers.
Develop AI pipelines for document processing, summarization, knowledge extraction, and conversational interfaces.
Deploy AI applications on AWS cloud services including Amazon Bedrock, SageMaker, EC2, Lambda, ECS, EKS, S3, DynamoDB, RDS, OpenSearch, API Gateway, and CloudWatch.
Build containerized applications using Docker and deploy them using Kubernetes, ECS, or EKS.
Implement scalable AI inference infrastructure using modern model serving technologies such as vLLM, Hugging Face TGI (Text Generation Inference), Triton Inference Server, or Ray Serve.
Build robust CI/CD pipelines and automate deployments for AI applications.
Implement observability, monitoring, and evaluation for AI systems using tools such as LangSmith, LangFuse, TruLens, Arize, Ragas, and DeepEval.
Optimize AI systems for latency, throughput, cost efficiency, and reliability in production environments.
Integrate AI applications with enterprise systems, APIs, data platforms, and external services.
Mentor engineering teams and establish best practices for building scalable AI applications and backend systems.

Required Experience

At least 2 years of hands-on experience building Generative AI or LLM-based applications in production.
min 5+ years of experience designing and developing Python applications and backend systems.
Strong experience developing REST APIs and microservices using FastAPI.
Hands-on experience integrating backend applications with AWS cloud services.
Experience building RAG pipelines, AI agents, and LLM orchestration workflows.

Required Skills

Strong programming expertise in Python.
Experience with backend frameworks such as FastAPI, Pydantic, and asynchronous programming.
Experience with Generative AI frameworks such as LangChain, LangGraph, LlamaIndex, Strands, AutoGen, CrewAI, Haystack, DSPy, or Semantic Kernel.
Experience implementing advanced RAG architectures including Graph RAG and hybrid retrieval pipelines.
Experience working with vector databases and semantic search systems.
Familiarity with machine learning and AI libraries such as PyTorch, TensorFlow, Hugging Face Transformers, Sentence Transformers, NumPy, Pandas, and Scikit-learn.
Experience deploying applications on AWS cloud infrastructure.
Experience building containerized services using Docker and deploying using Kubernetes or container orchestration platforms.
Strong understanding of scalable backend architecture, distributed systems, and cloud-native application development.

Nice to Have

Experience with model serving frameworks such as vLLM, Triton Inference Server, Ray Serve, or Hugging Face TGI.
Experience building Agentic AI workflows and autonomous AI systems.
Familiarity with AI evaluation frameworks, guardrails, and LLM safety mechanisms.
Experience building enterprise AI platforms or internal AI developer tooling.

Benefits

Comprehensive Medical Coverage: Health insurance of INR 5.0 Lakhs for you and your family (up to 6 members), ensuring complete peace of mind.
Robust Protection Plans: Group Personal Accident Insurance and Group Term Life Insurance to safeguard you and your loved ones.
Retirement Benefits: PF and Gratuity provided as per standard government regulations.
Flexible Work Options: Enjoy hybrid work arrangements & flexible working hours
Generous Leave Policy: 21 days of annual leave, in addition to 10 company-declared holidays.
Employee Well-being Spaces: Access to a dedicated break-out area with round-the-clock refreshments for relaxation and rejuvenation.

Click on Apply to know more.

This page is fully interactive when JavaScript is enabled. Please enable JavaScript to apply or browse related roles.