Flag job

Report

Remote Data Engineering Intern

Salary

$20k - $25k

Min Experience

0 years

Location

remote

JobType

internship

About the job

Info This job is sourced from a job board

About the role

Sayari is seeking a Data Engineering Intern to join its Data Engineering team. The intern will work with the Product and Software Engineering teams to collect data globally, maintain existing ETL pipelines, and develop new pipelines for Sayari Graph. Sayari Graph provides instant access to structured business information from billions of corporate, legal, and trade records. The application tier is built primarily in TypeScript, running in Kubernetes, and is backed by Postgres, Cassandra, Elasticsearch, and Memgraph. The data ingest tier operates on Spark, processing terabytes of data from hundreds of sources. The platform allows users to explore a large knowledge graph sourced from hundreds of millions of records in over 200 countries and 30 languages. The intern will have the opportunity to contribute to open-source projects, including the WebGL-powered network visualization library Trellis. This is a remote paid internship with work expectations of 20-30 hours per week.

About the company

Sayari provides instant access to structured business information from billions of corporate, legal, and trade records.

Skills

python
scala
git
apache spark
apache airflow
gcp
aws
azure
typescript
elasticsearch
kubernetes
postgresql
cassandra