Website:
altiorainfotech.com
Job details:
What You’ll Actually Do
• Build and deploy advanced AI-powered products end-to-end (frontend → backend → AI systems → cloud deployment)
• Design, fine-tune, integrate, and optimize AI/LLM models for real-world production use cases
• Build and manage MCP servers, AI agent infrastructures, tool-calling pipelines, and multi-agent systems
• Work deeply with chunking strategies, embeddings, vector databases, RAG pipelines, and long-context memory systems
• Develop scalable AI infrastructures capable of handling real-time inference, automation, and autonomous workflows
• Create intelligent retrieval systems using semantic search, hybrid search, reranking, and contextual memory pipelines
• Use AI tools (Cursor, Copilot, ChatGPT, Claude, OpenRouter, etc.) as force multipliers—not shortcuts
• Rapidly prototype, iterate, optimize, and ship AI features without sacrificing architecture quality
• Optimize prompts, token usage, inference speed, latency, and infrastructure costs
• Work directly with founders and product teams to transform ideas into production-grade AI systems
• Research and implement cutting-edge AI capabilities before they become mainstream
What Makes You a Fit
• You’ve built real AI products—not just cloned tutorials or basic chatbot demos
• You understand AI systems deeply, including models, embeddings, chunking, RAG, MCP, agents, memory, and orchestration
• You have hands-on experience creating or managing MCP servers and AI infrastructure pipelines
• You know how to structure, chunk, index, retrieve, and optimize large-scale data for LLMs
• You can architect scalable AI systems that remain maintainable as complexity grows
• You already use AI daily in development and know how to maximize productivity with it
• You think like a builder and take ownership from concept → deployment
• You move fast, learn fast, and solve problems independently
• You care about performance, scalability, clean architecture, and user experience equally
• You’re comfortable navigating ambiguity and building things that have never been built before
Tech Stack
• AI/LLMs: OpenAI, Claude, Gemini, OpenRouter, Hugging Face, Ollama, vLLM
• AI Frameworks: LangChain, LangGraph, CrewAI, AutoGen, MCP SDKs
• RAG & Retrieval: Pinecone, Weaviate, Qdrant, ChromaDB, FAISS, semantic search systems
• Backend: Node.js / Python / FastAPI
• Frontend: React / Next.js
• Databases: PostgreSQL / MongoDB / Redis / Vector Databases
• Cloud & Infra: AWS / GCP / Docker / Kubernetes
• Realtime & Voice: WebSockets, Twilio, LiveKit, AI voice pipelines
• Dev Workflow: Cursor, Copilot, CLI tooling, AI-assisted engineering workflows
Click on Apply to know more.