Tech Lead/ Staff Engineer- Agentic ai

Prismforce

full-time

Required skills

API
architectural direction
backend
caching
capacity planning
clustering
database
Docker
end-to-end
enterprise SaaS
infrastructure-as-code
JS
Kafka
Kubernetes
load balancing
multi-tenant
Node
RabbitMQ
Redis
SaaS
theorem
threads
TypeScript

About the role

Prismforce

Website: prismforce.com
Job details:
SSDE / Staff Engineer – Backend Platform

Who are We?

Prismforce is a Vertical SaaS company transforming the Talent Supply Chain for global Technology, Engineering, and IT Services organizations. Our AI-powered platform enables enterprises to improve operational agility, accelerate decisions, and increase profitability through intelligent workforce and talent infrastructure.

We are building large-scale, AI-native SaaS systems that operate across geographies, languages, tenants, and enterprise-scale workloads.

Job Title

Staff Engineer – Backend Platform

Experience

8–14 Years

Location

Bangalore / Pune

Employment Type

Full-time

Stack

Node.js

TypeScript
Distributed Systems
Multi-Tenant SaaS

About The Role

We are looking for a highly experienced Staff Engineer to lead the architecture and engineering of high-throughput, high-availability, multi-tenant SaaS platforms operating at global scale.

This role is for engineers who have designed and owned mission-critical backend systems handling large-scale concurrency, geo-distributed traffic, multilingual workloads, and enterprise-grade reliability requirements.

You will drive architecture decisions, platform scalability, database strategy, observability standards, and performance optimization across distributed systems. You are expected to independently own applications end-to-end — from architecture and design reviews to production reliability and engineering excellence.

This is a deeply technical leadership role. We are looking for hands-on engineers who can solve hard systems problems, guide architectural direction, and elevate engineering quality across teams.

What We’re Looking For — Priority Order

PrioritySkill AreaExpectationMust HaveDistributed Systems & Platform ArchitectureProven ownership of scalable, highly available SaaS systemsMust HaveCS Fundamentals & Runtime InternalsStrong expertise in concurrency, memory, CPU profiling, runtime behaviorMust HaveDatabase Design & PerformanceAdvanced schema design, indexing, partitioning, query optimizationMust HaveMulti-Tenant SaaS ArchitectureTenant isolation, scalability, geo-distribution, multilingual systemsStrong Working KnowledgeReliability & ObservabilityProduction monitoring, tracing, incident handling, SLO-driven systemsGood to HaveAI/LLM InfrastructureExposure to AI-native backend systems or agentic architectures

What You’ll Do

Architect and own high-throughput, high-availability backend platforms serving enterprise SaaS workloads.
Design globally scalable multi-tenant and multi-region systems with strong reliability guarantees.
Drive application ownership end-to-end — architecture, implementation, deployment, scalability, monitoring, and production operations.
Build resilient distributed systems with asynchronous event-driven architectures.
Lead platform-level decisions around scalability, caching, throughput optimization, resiliency, and fault tolerance.
Diagnose and resolve CPU bottlenecks, memory leaks, event loop blocking, connection exhaustion, and distributed system performance issues.
Define database architecture standards including indexing strategies, partitioning, sharding, replication, schema evolution, and query optimization.
Drive engineering excellence across observability, testing, logging, tracing, CI/CD, and operational readiness.
Lead high-level system design discussions and architectural reviews across teams.
Mentor engineers on system thinking, debugging, scalability, and production-grade engineering practices.
Collaborate with product and platform teams to design scalable multilingual and geo-distributed SaaS applications.
Influence long-term platform strategy and technical direction.

Core Technical Expectations

Distributed Systems & Scalable SaaS Architecture ★ Bar-Raising

This is the primary evaluation area for the role.

We are looking for engineers who have built and operated systems at scale — not just contributed features.

Expected Expertise

High-throughput distributed systems design
High-availability and fault-tolerant architectures
Multi-region and geo-distributed deployments
Multi-tenant SaaS architecture patterns
Event-driven systems and asynchronous processing
Horizontal scaling strategies
Load balancing and traffic distribution
CAP theorem and consistency trade-offs
Resilience engineering and graceful degradation
Circuit breakers, retries, backpressure handling
Distributed caching and eventual consistency
API gateway and service mesh fundamentals
Queueing systems: Kafka, RabbitMQ, or equivalent

Node.js Runtime & Systems Engineering ★ Bar-Raising

Deep runtime understanding is mandatory.

Expected Expertise

Node.js internals: event loop, libuv, V8 heap architecture
Worker threads, clustering, async execution models
Event loop blocking diagnosis
CPU and memory profiling in production systems
Heap snapshots, garbage collection analysis
Resource leak debugging
Process lifecycle and OS-level system behavior
Socket management and connection pooling
Performance optimization for long-running services
TypeScript at scale with strong type-system understanding

Database Design & Performance Engineering ★ Bar-Raising

We expect strong database architects — not ORM-only developers.

Expected Expertise

Advanced relational database design
Query planning and execution analysis
Indexing strategies:

B-tree
Composite indexes
Partial indexes
Covering indexes

Database partitioning and sharding
Multi-tenant data isolation models
Read/write scaling strategies
Replication and failover models
Schema migrations and zero-downtime deployments
MongoDB schema optimization and aggregation pipelines
Redis caching patterns and cache invalidation strategies
Data consistency and transaction management
Performance tuning under large-scale workloads

High-Level Design & Architecture

You should be comfortable driving architecture for large-scale enterprise platforms.

Expected Expertise

System decomposition and domain-driven design
Service boundaries and communication patterns
Scalability estimation and capacity planning
Trade-off analysis across architecture choices
Design documentation and RFC creation
Production readiness reviews
Reliability-first engineering mindset
Security and tenant isolation awareness
Designing multilingual SaaS platforms
Designing geo-aware and region-aware systems

Observability & Production Excellence

Expected Expertise

Distributed tracing and observability
Prometheus, Grafana, OpenTelemetry
Structured logging and correlation IDs
Incident debugging and root-cause analysis
Defining meaningful SLIs/SLOs
On-call ownership and operational maturity
CI/CD pipelines and deployment strategies
Docker and Kubernetes production deployments

Nice to Have

AI/LLM platform integration experience
Experience with agentic workflows or orchestration systems
Vector databases and semantic retrieval systems
Real-time transcript or speech-processing systems
Infrastructure-as-code exposure
Open-source contributions or technical writing
Prior experience in enterprise SaaS or platform engineering organizations

The Kind of Engineer We’re Looking For

You think in systems, trade-offs, and long-term scalability.
You naturally take ownership beyond your immediate module.
You can simplify complexity without oversimplifying the problem.
You are comfortable operating in ambiguity and high-growth environments.
You care deeply about performance, reliability, and maintainability.
You elevate engineering standards across teams through both execution and mentorship.
You balance pragmatism with architectural rigor.
You build platforms that other engineers love to work on.

Click on Apply to know more.

This page is fully interactive when JavaScript is enabled. Please enable JavaScript to apply or browse related roles.