Flag job

Report

Data Engineer Intern

Min Experience

4 years

Location

Delhi

JobType

internship

About the job

Info This job is sourced from a job board

About the role

TensorStax is developing the next generation of autonomous data engineering agents. The Role As a Data Engineer, you will design and develop production-grade pipelines that our agents learn from and eventually operate. You will be responsible for the modeling layer in dbt, orchestration layer in Airflow, and heavy-lift workloads in Spark. Key Responsibilities: Design complex, interdependent schemas in dbt across hundreds of tables Build advanced, multi-branch Airflow DAGs with sophisticated dependency and failure handling Author high-performance Spark jobs (PySpark or Scala) for large-scale batch and incremental workloads Codeify lineage, testing, and metadata so agents can reason about pipeline state Profile and tune query performance across warehouses and lakehouse engines Partner with the agent research team to expose realistic failure modes, data drifts, and SLA violations for RL training Containerize and deploy everything on Kubernetes-backed infrastructure

About the company

TensorStax is developing the next generation of autonomous data engineering agents.

Skills

sql
java
spark
dbt
airflow
python
scala