Generative AI Engineer

Speed Engineering Solutions

Location: Bengaluru, Karnataka, India
Job type: Full-time

Required skills

LangChain
AWS
Azure
CloudFormation
CloudWatch
CodePipeline
containerization
data modeling
Datadog
DevOps
Docker
Elasticsearch
FastAPI
Flask
GCP
GitHub
GraphQL
Kubernetes
microservices
OAuth2
Terraform
VPC
RESTful

About the role

Speed Engineering Solutions

Website: speed-global.com
Job details:

Key Responsibilities :

Architect and build GenAI-powered applications using LLMs and advanced RAG pipelines for technical documentation, maintenance, and support workflows.
Design and implement retrieval-augmented generation (RAG) systems leveraging vector databases and semantic search for contextual information retrieval.
Integrate and fine-tune LLMs (OpenAI, Anthropic, Amazon Bedrock, HuggingFace, etc.) for domain-specific tasks and continuous improvement.
Develop robust evaluation frameworks for LLM output (relevancy, faithfulness, summarization, contextual accuracy).
Build and deploy microservices and APIs for GenAI solutions, ensuring high availability, scalability, and secure access.
Collaborate with product, domain, and platform teams to refine requirements and deliver innovative GenAI solutions.
Stay current with GenAI trends, frameworks, and best practices to future-proof solutions for evolving enterprise needs.

Skill and Experience Required:

Experience Range : 4-8 Years
Qualification : Bachelor’s or Master’s Degree in Computer Science or any other B.E. / B.Tech.
GenAI & LLMs: OpenAI, Anthropic, Amazon Bedrock, HuggingFace Transformers, LangChain, RAG architectures, prompt engineering, fine-tuning.
Vector Databases & Semantic Search: Pinecone, FAISS, Milvus, Weaviate, Elasticsearch/OpenSearch with vector support.
Experience in Graph Database like Neo04/ Amazon Neptune is a must.
GenAI Evaluation & Monitoring: G-Eval, Prometheus, QAG scorer, MLflow, DataDog, CloudWatch, custom LLM-as-a-judge frameworks.
API & Microservices: RESTful API design, GraphQL, FastAPI, Flask, OAuth2/JWT, containerization (Docker, Kubernetes).
Security & Compliance: IAM, KMS, VPC endpoints, S3 Object Lock, CloudTrail, encryption policies, GDPR/industry compliance.
DevOps for GenAI: CI/CD pipelines (GitHub Actions, CodePipeline), Terraform, AWS CDK, CloudFormation for automated deployment and scaling of GenAI services.
Data Modeling for GenAI: Metadata tagging, schema design, data quality management for LLM training and inference.
Multi-modal Retrieval: Experience with text, images, video, telemetry for comprehensive GenAI solutions.
Building enterprise-grade RAG systems for technical documentation, support, or maintenance workflows.
Implementing multi-modal retrieval in large-scale environments.
Familiarity with continuous evaluation and improvement of LLMs in production (A/B testing, feedback loops).
Exposure to regulated industries (aviation, manufacturing, healthcare) with strict compliance and audit requirements.
Working with multi-cloud architectures (AWS, Azure, GCP) and hybrid deployments.
Advanced knowledge of MLOps: model deployment, monitoring, rollback, and lifecycle management.

Looking for Immediate Joiners only. Its totally WFO opportunity for our MNC Client at Bengaluru.

Location: Bangalore, Karnataka

Click on Apply to know more.

This page is fully interactive when JavaScript is enabled. Please enable JavaScript to apply or browse related roles.