zorba ai
Website:
zorbaconsulting.in
Job details:
Job Title: AWS Data Engineer (Python + PySpark)
Experience: 5+ Years
Job Summary
We are seeking an experienced AWS Data Engineer with strong expertise in Python and PySpark to design, build, and optimize scalable data pipelines. The ideal candidate should have hands-on experience with AWS data services, big data processing, and distributed systems, along with a solid understanding of data modeling and ETL/ELT processes.
Key Responsibilities
Design, develop, and maintain scalable data pipelines using AWS services
Build and optimize ETL/ELT workflows using PySpark and Python
Work with large-scale datasets in distributed environments (Spark)
Develop and maintain data models for analytics and reporting
Integrate data from multiple sources (APIs, databases, streaming systems)
Ensure data quality, validation, and governance standards
Optimize performance of data pipelines and Spark jobs
Collaborate with cross-functional teams to understand business requirements
Implement CI/CD pipelines and automation for data workflows
Troubleshoot and resolve data-related issues in production
Required Skills
Strong experience with Python and PySpark
Hands-on Experience With AWS Services
S3, EMR, Glue, Lambda, Redshift (or similar)
Solid understanding of ETL/ELT processes
Experience with Spark optimization and tuning
Strong SQL skills and data modeling knowledge
Experience with data warehousing concepts
Familiarity with REST APIs and data integration
Experience with Git / version control systems
Preferred Skills
Experience with streaming technologies (Kafka / Kinesis)
Knowledge of Airflow or workflow orchestration tools
Exposure to Docker / Kubernetes
Experience with CI/CD tools (Jenkins, GitHub Actions, etc.)
Experience with data visualization tools (Tableau, Power BI)
Skills: "pyspark,python,aws
Click on Apply to know more.