Gloify
Website:
gloify.com
Job details:
Job Summary:
We are looking for a skilled Python Developer with strong PySpark experience to design, develop, and optimize scalable data processing solutions. The ideal candidate should have hands-on experience working with large datasets, distributed computing, and data pipelines in a big data environment.
Key Responsibilities:
- Develop and maintain data pipelines using Python and PySpark.
- Work with large-scale datasets and implement ETL processes.
- Optimize Spark jobs for performance and scalability.
- Collaborate with data engineers, data scientists, and cross-functional teams to deliver data solutions.
- Design and implement data transformation and processing workflows.
- Ensure data quality, integrity, and reliability across pipelines.
- Troubleshoot and resolve data processing issues and performance bottlenecks.
- Participate in code reviews and follow best development practices.
Required Skills:
- Strong experience in Python programming.
- Should have 5+ years of experience in similar role
- Hands-on experience with PySpark and Apache Spark.
- Experience working with large datasets and distributed data processing.
- Strong knowledge of SQL and database concepts.
- Experience with ETL pipelines and data processing frameworks.
- Familiarity with Hadoop ecosystem (HDFS, Hive, etc.).
- Understanding of data structures and algorithms.
Click on Apply to know more.