PYXIDIA TECHLAB
Website:
pyxidiatech.com
Job details:
Position: Senior Data Analyst
Location: Mumbai / Bangalore
Join Us – Innovating Healthcare with AI and Data
We are transforming the U.S. healthcare revenue cycle market by combining cutting-edge AI with human expertise. Since 2011, we have grown rapidly, partnering with over 500 healthcare providers, including leading hospitals and diagnostic labs. Our Mumbai and Bangalore offices build the technology that powers these solutions.
About the Role
We are looking for a Senior Data Analyst who will play a critical role in designing, building, and scaling data solutions across products and client implementations. In this role, you will work closely with Product, Engineering, Data Science, and client-facing teams to translate complex business and healthcare use cases into scalable data models and pipelines.
You will also mentor junior analysts, influence data design decisions, and ensure data quality, reliability, and performance at scale.
Role & Responsibilities
- Own the design, development, and operation of scalable data pipelines across structured and unstructured data sources
- Build and optimize ETL/ELT workflows using SQL, Python, PySpark, and cloud-based big data technologies
- Define and enforce data quality, validation, and monitoring standards to ensure accuracy and trust in data
- Perform advanced data exploration and analysis to solve ambiguous business problems and deliver actionable insights
- Act as a senior data owner during product and client implementations, ensuring data use cases are correctly implemented, tested, and production-ready
- Collaborate with Product, Engineering, and Data Science teams on data architecture and analytical use cases
- Provide technical leadership and mentorship to junior analysts and contribute to data platform best practices
Required Skills & Experience
- 4+ years of experience in Data Analytics or Data Engineering roles
- Strong expertise in SQL and hands-on experience with relational and NoSQL databases
- Advanced experience with Python and PySpark for large-scale data processing
- Proven experience building and managing ETL pipelines using tools such as Airflow, AWS Glue, or similar
- Strong understanding of data modeling and performance optimization
- Experience working with cloud data services (S3, EMR, Redshift, Athena, RDS, etc.)
- Ability to own complex data problems end-to-end and influence cross-functional stakeholders
- Exposure to U.S. healthcare data is a strong plus
Click on Apply to know more.