AIML Engineer

InfoVision Inc.

full-time

Required skills

Airflow
AWS
Azure
compliance
cross-functional
data ingestion
data science
deep learning
Docker
end-to-end
ETL
GCP
GPU
Kubeflow
Kubernetes
NLP
Serverless
SQL
TensorFlow
Terraform
Pytorch
Vertex

About the role

InfoVision Inc.

Website: infovision.com
Job details:

We are seeking a highly experienced ML/AI Architect to design, build, and scale enterprise-grade AI/ML and GenAI solutions within our Digital Transformation Practice. The role requires strong technical leadership across ML system architecture, LLM-based solution design, MLOps automation, and cloud-native data engineering. The ideal candidate will drive innovation, establish best practices, and deliver production-ready AI solutions at scale

Key Skills & Competencies

LLMs / GenAI: GPT-4/5, Llama 3/3.5, Qwen, Claude, RAG, prompt engineering, LoRA/QLoRA, embeddings
ML / DL: PyTorch, TensorFlow, Transformers, CNNs, XGBoost, NLP/NLU, NER
MLOps: MLflow, Airflow, Kubeflow, SageMaker, Vertex AI, Azure ML, CI/CD/CT, monitoring
Data Engineering: PySpark, SQL, ETL/ELT pipelines, orchestration, data quality
Cloud & DevOps: AWS, Azure, GCP, Kubernetes, Docker, Terraform, GPU stack
Other: Vector DBs (Pinecone, Weaviate, Chroma, Milvus), observability, model optimization, model explainability (SHAP/LIME)

Roles & Responsibilities

Architect end-to-end ML systems covering data ingestion, feature engineering, model training, deployment, and monitoring.
Design scalable ML pipelines for classical ML, deep learning, and GenAI workloads including LLMs, RAG, embeddings, and semantic search.
Select optimal algorithms, frameworks, and cloud platforms to address complex business and technical challenges.
Implement model optimization strategies such as quantization, pruning, distillation, and LoRA/QLoRA fine-tuning.
Build LLM-powered solutions using prompt engineering, knowledge grounding, agents, and retrieval-augmented generation.
Work with top LLMs (GPT-4/5, Llama 3/3.5, Qwen, Claude, Gemini) and integrate vector DBs like Pinecone, Weaviate, Chroma, Milvus.
Evaluate and improve LLM and ML model performance, reduce hallucinations, and ensure security and compliance.
Lead development of ML, NLP, NER, and deep learning models using Transformers, CNNs, XGBoost, and other advanced architectures.
Drive data science workflows including EDA, feature engineering, statistical modeling, and explainability (SHAP/LIME).
Architect MLOps pipelines (CI/CD/CT), enabling model versioning, monitoring, drift detection, and automated retraining.
Deploy models at scale using Kubernetes, Docker, Terraform, serverless components, and GPU-based environments with strong observability.
Collaborate with cross-functional teams, lead technical discussions and POCs, and mentor ML/AI teams to deliver enterprise-grade solutions.

Click on Apply to know more.

This page is fully interactive when JavaScript is enabled. Please enable JavaScript to apply or browse related roles.