Website:
yminds.ai
Job details:
About the Role
Our client is seeking a highly experienced Senior Data Engineer (3-15 years) to join their Data Platform team. This role focuses on designing, building, and optimizing large-scale data infrastructure, pipelines, and cloud-based data platforms.
The ideal candidate will work across the full data lifecycle—enabling seamless data integration, scalable processing, and advanced analytics—while supporting cross-functional teams and ensuring high-performance, reliable data systems for business-critical applications.
Location : Gurugram and Chennai
Key Responsibilities
- Design, deploy, configure, and manage multi-node big data clusters across Dev, Test, and Production environments
- Build and maintain scalable data pipelines, ETL/ELT workflows, and data lake architectures
- Integrate data from multiple sources ensuring data quality, reliability, and accessibility
- Develop automation scripts using Python and Shell scripting to streamline infrastructure and operations
- Work with distributed data processing frameworks such as Apache Spark and Hadoop ecosystem
- Design and maintain data models (dimensional modeling, star/snowflake schemas) for analytics and reporting
- Develop and support BI solutions, dashboards, and KPIs for business insights
- Optimize platform performance (scalability, latency, availability) and resolve system issues
- Monitor, debug, and maintain production data systems and pipelines
- Build tools and frameworks for ETL monitoring, validation, and troubleshooting
- Collaborate with cross-functional teams including developers, analysts, and business stakeholders
- Support production environments and on-call operations
- Perform code reviews, data validation, and QA checks
- Continuously evaluate and adopt new tools and technologies to improve data platform capabilities
Required Skills & Experience
- 3-15 years of experience in Data Engineering / Big Data environments
- Strong expertise in Data Lake technologies:
- Apache Spark, Hadoop, Yarn, Distributed File Systems
- Experience with Cloud Platforms:
- GCP (BigQuery, Dataflow, Cloud Storage) or AWS (Redshift, EMR)
- Strong proficiency in SQL (Vertica, Dremio, or similar big data SQL engines)
- Hands-on experience with Python and Shell scripting
- Experience building ETL pipelines and OLAP systems
- Experience with workflow orchestration tools (e.g., Apache Airflow)
- Strong understanding of data warehousing concepts
- Experience in data modeling for analytics and reporting
- Strong debugging, monitoring, and troubleshooting skills
- Familiarity with version control systems and relational databases
- Experience working in Linux-based environments
- Bachelor’s/Master’s degree in Computer Science or related field
Nice-to-Have Skills
- Experience with Visualization tools (Apache Superset, Tableau)
- Exposure to AWS EMR and advanced cloud services
- Strong understanding of data structures and analytics workflows
- Experience working in Agile environments
- Experience with on-call production support
- Strong communication and system design skills
About YMinds.AI
YMinds.AI is a talent and consulting partner connecting top-tier tech professionals with high-growth organizations. We specialize in building high-performing engineering teams by aligning the right talent with the right opportunities, ensuring long-term success for both clients and candidates.
Keywords
Senior Data Engineer, Big Data, Data Lake, Apache Spark, Hadoop, Yarn, GCP, AWS, SQL, Python, ETL, OLAP, Data Modeling, BI, Data Pipelines, Airflow, BigQuery, Redshift
Hashtags
#DataEngineering #SeniorDataEngineer #BigData #ApacheSpark #Hadoop #GCP #AWS #Python #SQL #Hiring #ChennaiJobs #TechJobs #DataJobs
Click on Apply to know more.