Cactus Communications
Website:
cactusglobal.com
Job details:
Overview:
CACTUS is a remote-first organization and we embrace an accelerate from anywhere culture. You may be required to travel to our Mumbai office based on business requirements or for company/team events.
We are looking for a Data Science Lead to spearhead the development of advanced predictive, prescriptive, and generative AI models. In this role, you will mentor a team of specialists to establish end-to-end methodologies—from feature engineering to large-scale model deployment—while driving data-driven solutions for optimization and service enhancement. If you are a research-driven leader committed to Responsible AI, including bias detection and model transparency, this role offers the chance to define the lifecycle standards for impactful AI initiatives.
Responsibilities:
- Lead and mentor a team of Data Scientists and Analysts in developing predictive, prescriptive, and generative AI models
- Establish end-to-end data science methodologies, including feature engineering, model selection, and performance evaluation
- Develop data-driven solutions for optimization, engagement, and service enhancement
- Define data model lifecycle standards, including model versioning, retraining, and explainability frameworks
- Collaborate with AI/ML Engineers to operationalize machine learning models within production systems
- Define and monitor AI performance KPIs such as accuracy, latency, and fairness
- Support responsible AI initiatives through bias detection, fairness validation, and model transparency
- Review and validate deliverables from external AI vendors or partners as part of model governance
Requirements:
- B.Tech / M.Tech / M.S. / Ph.D. in Statistics, Mathematics, Computer Science, or any quantitative discipline
- Advanced certifications in Data Science, Machine Learning, or AI (e.g. AWS ML Specialty) preferred
- Research papers, case studies, or significant open source contributions are preferred
- 8–12 years of total experience in data science and analytics
- 6–8 years in advanced analytics, statistical modelling, and research-driven problem solving
- Minimum 4–6 years leading data science teams or projects involving large-scale model deployment
- Proven track record in implementing AI/ML models in production environments within enterprise setups
Technical Competencies:
- Programming & Tools: Python (pandas, NumPy, scikit-learn, PyTorch, TensorFlow), R, SQL
- Advanced ML Techniques: Deep learning, ensemble methods, time series forecasting, NLP, computer vision, reinforcement learning, and causal inference
- Statistical Modelling: Regression, classification, clustering, Bayesian inference, time-series forecasting
- AI Techniques: Deep learning, NLP (Transformers, BERT, GPT), computer vision, anomaly detection
- Big Data & Analytics: Apache Spark, Databricks, Hadoop, Hive, ELT pipelines
- Visualization: Tableau, Power BI, Matplotlib, Seaborn
- Research & Innovation: Literature review, research methodology, and evaluation of emerging AI technologies
- Model Lifecycle Management: MLflow, Data Version Control (DVC), and experiment tracking frameworks
- Model Evaluation: Advanced metrics, cross-validation strategies, bias detection, fairness evaluation, and model interpretability
- Governance: Responsible AI design, model interpretability, and privacy-preserving AI (DP, FL)
About Cactus:
Established in 2002, CACTUS (cactusglobal.com) is a leading technology company that specializes in expert services and AI-driven products which improve how research gets funded, published, communicated, and discovered. Its flagship brand Editage offers a comprehensive suite of researcher solutions, including expert services and cutting-edge AI products like Mind the Graph, Paperpal, and R Discovery. With offices in Princeton, London, Singapore, Beijing, Shanghai, Seoul, Tokyo, and Mumbai and a global workforce of over 3,000 experts, CACTUS is a pioneer in workplace best practices and has been consistently recognized as a great place to work.
Click on Apply to know more.