zimetrics
Website:
zimetrics.com
Job details:
Job Title: Data Engineer (Python, PySpark, SQL)
Experience: 4-7 Years
Location: Pune / Hybrid
Role Overview
We are looking for a skilled Data Engineer to design, build, and optimize scalable data pipelines. The ideal candidate should have strong hands-on experience in Python, PySpark, and SQL, with a good understanding of data processing frameworks and large-scale data systems.
Key Responsibilities
- Design, develop, and maintain scalable data pipelines using Python and PySpark
- Work with large datasets to perform data transformation, cleansing, and aggregation
- Write optimized and complex SQL queries for data extraction and analysis
- Collaborate with data analysts, data scientists, and business teams to understand data requirements
- Ensure data quality, integrity, and reliability across systems
- Optimize performance of data workflows and ETL processes
- Work with cloud platforms (AWS/Azure/GCP) for data storage and processing (preferred)
- Monitor and troubleshoot data pipeline issues
Required Skills
- Strong experience in Python programming
- Hands-on experience with PySpark / Apache Spark
- Expertise in SQL (joins, subqueries, performance tuning)
- Experience with ETL processes and data pipeline development
- Understanding of data warehousing concepts
- Familiarity with distributed data processing
Good to Have
- Experience with cloud platforms like AWS, Azure, or GCP
- Knowledge of tools like Airflow, Kafka, or Hive
- Experience with big data technologies and data lakes
- Basic understanding of data modeling
Soft Skills
- Strong problem-solving and analytical skills
- Good communication and collaboration abilities
- Ability to work in a fast-paced environment
Click on Apply to know more.