Quikr
Website:
quikr.com
Job details:
Role: Data Engineer
Experience: 2–3 years of hands-on experience in data engineering or large-scale data extraction
Location: Noida Sector 62 (Onsite role) -10 AM- 7 PM working hours
Role and Key Responsibilities:
About the Company
We are building an AI-driven travel platform designed to redefine how people plan their journeys. Our mission is to create a self-learning machine learning system capable of generating intelligent, hyper-personalized plans using rich global tourism data. Our platform combines large-scale structured and unstructured travel datasets with advanced AI to deliver smart recommendations tailored to individual preferences. About the Role We are looking for a skilled Data Engineer who will focus primarily on large-scale travel data scraping and refinement. Your core responsibility will be to collect deep travel-related data and transform it into structured, AI-ready datasets that power our machine learning models.
Requirements
• Bachelor’s degree in computer science, engineering, or related field.
• 2–3 years of hands-on experience in data engineering or large-scale data extraction.
• Strong Python (Pandas, NumPy).
• Advanced SQL/MySQL (query optimization, indexing).
• Experience with Scrapy, Beautiful Soup, or Selenium.
• Handling large datasets (millions of records).
• Data cleaning, transformation, and validation.
• Experience with APIs (Google Places, OpenStreetMap).
• Basic Bash scripting and automation.
• Cron jobs / Airflow for ETL pipelines.
• Docker basics and VPS server knowledge.
• Git version control.
• Handling large-scale data storage systems.
• Experience in travel or location-based data systems.
candidate for Data engineer position having strong experience in data scraping
What Will You Do?
• Lead large-scale scraping initiatives focused on deep travel and tourism datasets.
• Collect detailed information about travel destinations.
• Extract structured and unstructured travel data from multiple global sources.
• Clean, normalize, and refine raw datasets into high-quality AI-ready formats.
• Build automated scraping and ETL pipelines to continuously expand and refresh travel datasets.
• Handle geospatial data (latitude/longitude, clustering, distance calculations).
• Implement anti-bot handling mechanisms and proxy rotation systems.
• Perform feature extraction, sentiment tagging, and category clustering.
• Prepare optimized datasets specifically for AI model training.
• Continuously improve data depth, coverage, and accuracy.
• Collaborate closely with AI/ML engineers to enhance model performance.
Click on Apply to know more.