Flag job

Report

Data Engineer

Min Experience

3 years

Location

Bengaluru

JobType

full-time

About the job

Info This job is sourced from a job board

About the role

Job Role: Data Engineer

Location: Bangalore

Educational Background: Bachelor’s or master’s degree in computer science, Engineering, or a related field.

Experience: 5 years of experience in Data Engineering preferably in manufacturing industry
and logistics, transportation process areas, with proven expertise in delivering analytics
solutions. Prior consulting experience is a plus.
Technical Expertise:
• Proficient understanding of distributed computing principles.
• Data engineering implementation experience with Big Data technologies in cloud and on-
prem environments.
• Proficiency with big data technologies managing large data sets, databases, data lakes and
cloud-based storage.
• Expertise in managing and optimizing Spark clusters, along with other implementations of
Spark.
• Strong programming skills in Python, Py-spark
• Strong proficiency in SQL and experience with relational databases (PostgreSQL, MySQL,
Oracle, etc.) and NoSQL databases (MongoDB, Cassandra, DynamoDB).
• Experienced on the software development using services such as AWS Glue, S3, Lambda,
Spark in hyperscaler AWS.
• Knowledge of data modeling techniques such as star/snowflake, data vault etc.,
• Knowledge of semantic modeling
• Familiarity with BI Tools such as Power BI, Oracle Analytics Cloud, etc.,
• Strong problem-solving skills - Be able to hone business acumen with a capacity for
straddling between macro business strategy to micro tangible data and AI products.
• Database: AWS S3 (Landing, Bronze, Silver, Gold)
• Data Transformation: Python script
• Data Read/Write to Medallion Architecture: PySpark script
• Application DB: Postgres
• Scope: No Data Science work; primarily data cuts and variance analysis
• Skills: Strong understanding of the AWS ecosystem
• Proficiency in Python/PySpark with a good grasp of object-based structures

Skills

Python
PySpark
PostgreSQL
MySQL
MongoDB
Cassandra
Dynamodb
AWS
Data Modeling