About the role
At Crunchbase, our dataset is a living, breathing entity, expanding daily through the collective efforts of the public, our venture program, partners, and our dedicated internal team. The Data Management team at Crunchbase takes the lead in driving initiatives that not only ensure the accuracy but also accelerate the growth of our dynamic dataset.
If you're motivated by the prospect of solving complex problems using data and AI, then we have an exciting opportunity for you! We're seeking individuals with a passion for engaging in diverse projects that span across building tools to enable AI initiatives, optimization of AI models, research, data curation, analysis, and program management.
Essential Duties/Responsibilities:
Compile and experiment with different prompts to test and optimize LLM outputs to achieve desired results
Collaborate with data scientists and engineers to create training data sets and fine-tune the AI model's behavior and responses
Define LLMs models' performance metrics and conduct regular evaluations to measure and improve model performance
Design, develop, and maintain tools and infrastructure to support and enable LLM initiatives, staying ahead of industry advancements and updating tooling as necessary to ensure cutting-edge capabilities and optimal performance
Keep abreast of industry best practices, stay informed on emerging trends, and ethical considerations in AI development
Work with external vendors to get high quality data labels
Review and fix and data quality issues within Crunchbase data
Review and audit outsource team’s work to ensure the accuracy of the data entered in Crunchbase
Required Skills/Abilities:
Excellent written and verbal communication skills
Ability to generate high-quality documentation
Ability to work independently and in a team environment
Strong analytical and problem-solving skills.
Education and Experience:
1+ years of industry experience in data labeling or other data management fields
Familiarity or experience in LLMs such as GPT
Intermediate/Advanced SQL skills, Python skills
Experience with Google Sheets
You hold a high bar on code quality and enjoy building systems and tools that can be used by others.
You are excited and interested in production systems and enabling LLMs and other MLs models.
Excellent written and verbal communication skills, problem-solving skills, and analytical skills
Ability to generate high-quality documentation
Ability to work independently and in a team environment
Physical Requirements:
Prolonged periods of sitting at a desk and working on a computer.
You may also be entitled to receive equity and benefits.
About the company
Crunchbase helps over 75 million people around the world connect with the companies and people that matter. Powered by best-in-class proprietary data, Crunchbase is democratizing access to opportunities so salespeople, entrepreneurs, investors, job seekers, and others can accelerate innovation for a better future. We’re proud to build intelligent products that shape how companies and people connect and enable them to communicate in a more meaningful way.