Flag job

Report

Data Engineer

Salary

$0.1035k - $0.1435k

Min Experience

5 years

Location

remote

JobType

full-time

About the job

Info This job is sourced from a job board

About the role

The Data Engineer will play a crucial role in advancing the CDC Foundation's mission by designing, building, and maintaining data infrastructure for a public health organization. This role is aligned to the Workforce Acceleration Initiative (WAI). WAI is a federally funded CDC Foundation program with the goal of helping the nation's public health agencies by providing them with the technology and data experts they need to accelerate their information system improvements. Working within the Illinois Department of Public Health, to build, optimize, and manage cloud-based data pipelines and ETL processes on the Snowflake platform. This role will focus on implementing high-performance data systems, ensuring the reliability and scalability of our data infrastructure, and enabling robust analytics and insights for the organization. The ideal candidate will have hands-on experience with Snowflake, cloud environments, and data engineering best practices. The Data Engineer will be hired by the CDC Foundation and assigned to the Illinois Department of Public Health. This position is eligible for a fully remote work arrangement for U.S. based candidates. Responsibilities: - Develop a detailed plan for database migration, ETL processes, and data processing applications. - Design, build, and manage ETL/ELT processes and data pipelines on the Snowflake platform, ensuring the movement of large datasets between various data sources. Develop efficient, scalable data architectures and implement Snowflake best practices, including partitioning, clustering, and query optimization for performance and cost. Collaborate with data scientists, analysts, and Local health departments to integrate diverse data sources into Snowflake, ensuring data is available for analytics and reporting. - Monitor data pipelines and systems for performance issues, costs, errors and anomalies, and implement solutions to address them. - Collaborate with the IT Security Team to conduct security and access testing. Implement security measures to protect sensitive information. - Collaborate with cross-functional teams to understand data requirements and design scalable solutions that meet business needs. Collaborate with Systems Architect on overall system health, focusing on data aspects and data warehouse. Collaborate with Systems Architect on infrastructure assessment, focusing on data aspects. - Implement and maintain ETL processes to ensure the accuracy, completeness, and consistency of data. - Design and manage data storage systems, including relational databases, NoSQL databases, and data warehouses. - Knowledgeable about industry trends, best practices, and emerging technologies in data engineering, and incorporating the trends into the organization's data infrastructure. - Provide technical guidance to other staff. Create and maintain clear documentation for ETL processes, data pipelines, data models, and infrastructure setups. Develop training materials and conduct online sessions on accessing and utilizing shared data. - Communicate effectively with partners at all levels of the organization to gather requirements, provide updates, and present findings. - Create a data governance framework for secure and compliant data sharing. - Establish successful connection migration plan for ETL processes and APIs between migrated applications and databases. - Implement automated processes for data extraction from source systems and loading into the data warehouse. - Migrate ETL processes and APIs to the cloud environment.

About the company

The CDC Foundation helps the Centers for Disease Control and Prevention (CDC) save and improve lives by unleashing the power of collaboration between CDC, philanthropies, corporations, organizations and individuals to protect the health, safety and security of America and the world. The CDC Foundation is the go-to nonprofit authorized by Congress to mobilize philanthropic partners and private-sector resources to support CDC's critical health protection mission. Since 1995, the CDC Foundation has raised over $1.9 billion and launched more than 1,300 programs impacting a variety of health threats from chronic disease conditions including cardiovascular disease and cancer, to infectious diseases like rotavirus and HIV, to emergency responses, including COVID-19 and Ebola. The CDC Foundation managed hundreds of programs in the United States and in more than 90 countries last year.

Skills

sql
python
java
scala
hadoop
spark
kafka
flink
snowflake
data modeling
etl
data integration