Flag job

Report

Python Developer with Python,Pyspark and AWS

Location

Hyderabad, Telangana, India

JobType

full-time

About the job

Info This job is sourced from a job board

About the role

CGI

Website: cgi.com
Job details:
Founded in 1976, CGI is among the largest independent IT and business consulting services firms in the world. With 94,000 consultants and professionals across the globe, CGI delivers an end-to-end portfolio of capabilities, from strategic IT and business consulting to systems integration, managed IT and business process services and intellectual property solutions. CGI works with clients through a local relationship model complemented by a global delivery network that helps clients digitally transform their organizations and accelerate results. CGI Fiscal 2024 reported revenue is CA$14.68 billion and CGI shares are listed on the TSX (GIB.A) and the NYSE (GIB). Learn more at cgi.com.

Job Title:

Python Developer with Python, Pyspark and AWS

Position: -Software Engineer

Experience: 5- 8 Years

Category: Software Development/ Engineering

Shift: General

Main location: Hyderabad/Bangalore/Chennai

Position ID: J0326-2285

Employment Type: Full Time

We are seeking a skilled Python Developer with strong expertise in PySpark and AWS to design, develop, and optimize scalable data processing solutions. The candidate will be responsible for building high-performance data pipelines, working with large datasets, and contributing to cloud-based data platforms.

Your future duties and responsibilities

Design, develop, and maintain scalable data pipelines using Python and PySpark

Process and analyze large datasets in distributed computing environments

Develop ETL workflows and data transformation logic

Work with AWS services such as S3, Glue, Lambda, and EMR

Optimize data processing jobs for performance and cost efficiency

Collaborate with data engineers, analysts, and business stakeholders

Ensure data quality, integrity, and security across systems

Troubleshoot and resolve data-related issues in production environments

Participate in code reviews and follow best practices in development

Required Qualifications To Be Successful In This Role

Strong proficiency in Python programming

Hands-on experience with PySpark and Apache Spark

Experience working with AWS cloud services (S3, EC2, Glue, EMR, Lambda)

Solid understanding of ETL processes and data warehousing concepts

Experience with SQL and relational databases

Knowledge of distributed data processing and big data technologies

Familiarity with version control tools like Git

Good problem-solving and analytical skills

Preferred Qualifications

Experience with Airflow or other workflow orchestration tools

Knowledge of data lake architecture and data modeling

Experience with CI/CD pipelines

Familiarity with Docker and containerization

AWS certification is a plus

Skills

Python

PySpark

AWS (S3, Glue, EMR, Lambda)

SQL

Data Engineering

Big Data Processing

Together, as owners, let’s turn meaningful insights into action.

Life at CGI is rooted in ownership, teamwork, respect and belonging. Here, you’ll reach your full potential because…

You are invited to be an owner from day 1 as we work together to bring our Dream to life. That’s why we call ourselves CGI Partners rather than employees. We benefit from our collective success and actively shape our company’s strategy and direction.

Your work creates value. You’ll develop innovative solutions and build relationships with teammates and clients while accessing global capabilities to scale your ideas, embrace new opportunities, and benefit from expansive industry and technology expertise.

You’ll shape your career by joining a company built to grow and last. You’ll be supported by leaders who care about your health and well-being and provide you with opportunities to deepen your skills and broaden your horizons.

Come join our team—one of the largest IT and business consulting services firms in the world.

Click on Apply to know more.

Skills

Python
Airflow
AWS
Apache
Apache Spark
big data technologies
CGI
containerization
data lake
data modeling
Docker
EC2
end-to-end
ETL
intellectual property
Lambda
SQL
version control