Flag job

Report

Internship

Salary

$30k - $50k

Min Experience

0 years

Location

san francisco, remote

JobType

internship

About the job

Info This job is sourced from a job board

About the role

Cleanlab offers solutions that easily reduce time/cost to turn unreliable data into reliable models and analytics. We provide Data-Centric AI software to automatically find and fix issues in your datasets and assess quality so you can trust your data and the solutions you build on your data. We hire a select few interns every fall/spring/summer, working on various applied Machine Learning and Data Science projects. As a small startup, we can only take interns capable of completing high-value projects independently. This means you must at least be in your junior year of college (graduate students preferred). Interns must have extremely strong Python skills and collaborative software development experience (Github, CI/CD, etc), as well as extensive coursework in Data Science, Analytics, or Machine Learning. Please link to your favorite project (Github, blogpost, etc), ideally something that showcases your: writing skills, ability to produce graphics/visualizations, coding skills, Data Science knowledge, creativity, and overall desire to use AI to solve impactful problems. Internships may be fully remote, but if you prefer to work in an office, we have a nice one in San Francisco.

About the company

Prior to Cleanlab, our founders (3 ML PhDs from MIT) worked at OpenAI, Google, Microsoft, Amazon, AWS, Facebook AI Research (FAIR), Dropbox, Oculus, Palantir, NASA, General Electric, MIT Lincoln Laboratory, MIT, Harvard, and Stanford – at every place we worked we repeatedly encountered the same issue – AI solutions failed to work reliably on real-world, human-centric data due to label errors and poor data quality. So, we spent eight years of PhD research at MIT inventing a new field to solve this problem and after successful pilots with world-leading organizations, Cleanlab emerged. Everything we do at Cleanlab is guided by our north star – to improve the world's ML data more easily and quicker than any other solution – enabling AI systems to train more reliably on real-world, messy, error-prone data. We develop next-generation data-centric AI, open-source algorithms and provide no-code SaaS enterprise solutions to help individuals and teams at companies (across all industries) diagnose/fix issues in their datasets and produce more reliable ML models by providing clean labels for training. While many companies can help store/manage data or develop ML models, there exist few solutions today to improve the quality of existing data, which is the core asset of the modern enterprise. This is where you come in. At Cleanlab, you'll be able to take ownership of critical projects that pioneer the future of data-centric AI.

Skills

python
data science
machine learning
data analytics
software development