About the Company
Growth Protocol is the Enterprise Reasoning Platform, based in New York, USA. We specialize in growth intelligence—combining a neuro-symbolic reasoning core, seamless data activation, and agentic workflows. We’re committed to accelerating growth, reducing execution drag, and delivering measurable business outcomes. Our mission is to turn strategy into action so enterprises move ahead of the curve.
Job Description
We are seeking an ambitious individual who welcomes the challenge of meeting the needs of a rapidly growing startup. As a Sr. Data Engineer, you will be at the heart of Growth Protocol’s data infrastructure, playing a foundational role in building the systems that power our AI platform. Your work will have a direct impact on product features, client outcomes, and strategic business decisions.
You will collaborate with Data Scientists, Backend Engineers, and business stakeholders to build and maintain scalable pipelines that serve billions of rows of structured and unstructured data weekly, enabling high-impact insights across multiple industries.
Responsibilities
Collaboration
- Work closely with Data Scientists to translate business and ML requirements into robust data workflows.
- Ensure timely delivery of clean, reliable data to support model development and production features.
Technical Development
- Engineer and manage scalable ETL architecture using Apache Beam, Snowpark, Cloud Run and Airflow.
- Skilled in data extraction from diverse online platforms.
- Design and implement a high-performance data infrastructure for seamless processing and integration.
- Operationalize machine learning models, focusing on deployment, reliability, and performance.
Data Connectivity & Solutions
- Client Onboarding: Partner with client IT teams to identify the most efficient and secure methods for data ingestion (e.g. Snowflake Sharing, Databricks deltasharing, Private Link, or VPN).
- Work alongside our Platform Engineering team to define the requirements for secure networking paths, ensuring the infrastructure supports high-performance data transfers.
- Perform the end-to-end testing of client connections to ensure data integrity and connectivity.
- Integrate customer databases with our platform
Monitoring & Reliability
- Create and manage real-time monitoring systems for data ingestion and transformation pipelines.
- Proactively identify and resolve issues to maintain high levels of system reliability and data integrity.
Minimum Qualifications
- 5+ years of experience in Data Engineering
- Bachelor’s degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience)
- Experience building data pipelines with robust unit and integration testing
- Proficiency in distributed computing frameworks (e.g., Apache Beam, Spark)
- Functional understanding of enterprise networking (VPC peering, Private Link, VPNs) and the ability to troubleshoot connectivity in a cloud environment.
- Hands-on experience operationalizing ML models in production
- Familiarity with ML/AI, NLP, and Data Science workflows including MLflow
- Deep understanding of ETL workflows, data modeling, and data architecture
- Strong debugging and problem-solving skills
- Excellent communication skills and experience collaborating across teams
Preferred Qualifications
- Experience working on enterprise products serving Fortune 500 clients across Financial Services, Industrial Manufacturing, and Consumer Products
- Prior startup experience
- Interest in current events, market dynamics, and emerging technologies
- Experience creating Agent Skills
- Familiarity with using API and webscraping to collect data
- Familiarity with Graph Databases
Tech Stack
- Languages: Python, TypeScript
- Frameworks: Apache Beam, Spark, FastAPI, Airflow
- Cloud: Google Cloud Platform (GCP), AWS, Azure
- Data: Snowflake, Databricks, PostgreSQL, Elasticsearch, MongoDB, GCS, Neo4j
- Infrastructure & DevOps: Docker, Terraform, GitHub Actions, Cloud Run
- Frontend: Next.js
Perks
- Competitive compensation and equity in a rapidly growing company
- 100% Company Paid Health, Dental, and Vision Insurance