Flag job

Report

Python | SQL| Databricks Developer

Min Experience

4 years

Location

Bengaluru, Karnataka, India

JobType

Full-time

About the job

Info This job is sourced from a job board

About the role

Thank you for considering the Python | SQL| Databricks Developer position at Reveal HealthTech. We are an early-stage IT startup based in the US and India, focused on leveraging technology to deliver transformative healthcare solutions. ABOUT THE ROLE We are looking for a skilled developer to join our growing data team. In this role, you will design, build, and optimize scalable data pipelines on the Databricks platform to support advanced analytics and machine learning initiatives. The ideal candidate brings hands-on experience with PySpark, SQL, and data transformation workflows, and thrives in a fast-paced, collaborative environment. Find out more about our mission and why we started Reveal HealthTech on our website. Requirements Responsibilities Design and implement scalable, high-performance data pipelines using Databricks and Apache Spark. Build and maintain ETL workflows using Databricks notebooks and integrate data from diverse sources. Develop efficient code in Python and PySpark for data processing and transformation. Write advanced SQL queries to manipulate, analyze, and validate data. Optimize data workflows for performance and cost-efficiency. Collaborate with data scientists, software engineers, and stakeholders to translate business needs into technical solutions. Implement data quality checks and validation frameworks to ensure data accuracy and reliability. Monitor and troubleshoot pipeline performance and data issues. Maintain clear technical documentation of workflows, codebases, and architecture. Stay current with industry trends and best practices in big data, Databricks, and cloud data platforms. Key Skills and Qualifications Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent work experience). 4–5 years of experience as a Data Engineer or in a similar role. Strong hands-on experience with Databricks and Apache Spark. Proficiency in PySpark and Python for data engineering workflows. Solid understanding of data lake, ETL design patterns, and distributed data processing. Experience with SQL for data querying and transformation. Familiarity with version control systems like Git and participation in code reviews. Understanding of cloud platforms such as AWS, Azure, or GCP (AWS preferred). Experience in building and optimizing data pipelines that feed machine learning models and analytics. Ability to work independently and collaboratively with cross-functional teams. Strong analytical and debugging skills.

About the company

We are an early-stage IT startup based in the US and India, focused on leveraging technology to deliver transformative healthcare solutions.

Skills

python
sql
databricks