VIKAS KUMAR SINGH
Professional Summary 🚀
Data Science Manager with 9+ years of expertise in delivering production-grade AI/ML and Generative AI solutions across Retail, Pharmaceutical, and Insurance industries. Proficient in building scalable ML/GenAI systems, managing large-scale data workflows, and integrating predictive analytics into decision platforms. Strong background in experimentation design, statistical modeling, MLOps (AWS, Databricks, MLflow), and leading cross-functional teams to align ML strategy with business goals.
Core Competencies
- Generative AI & LLM Architecture (RAG, Multi-Agent Systems)
- Forecasting (Time Series, Bayesian, Geospatial Analysis)
- Cloud & MLOps Platforms (AWS, Databricks, MLflow, Docker)
- Executive Stakeholder Management & AI Strategy Definition
- Team Leadership & Mentorship (10+ members)
- Scalable Data Engineering (PySpark, SQL, ETL Pipelines)
Professional Experience
Tredence Analytics
Manager – Data Science | Nov 2024 – Present
- Designing the technical roadmap and architecture for end-to-end AI/ML solution delivery, optimizing existing retail planning and store operations.
- Architecting the GenAI integration and platform readiness for Category Performance Analysis.
- Deployed machine learning models enabling a forecasting accuracy of 60% for new proposed sites across multiple sales categories.
- Built and deployed ML pipelines using Databricks and MLflow, enabling automated training, evaluation, and monitoring.
- Collaborated with international business teams across the US to integrate model outcomes into planning system.
ZS Associates
Business Technology Solutions Consultant | Apr 2021 – Aug 2024
- Delivered commercial analytics and ML solutions for top-tier pharma clients, leveraging AWS-based infrastructure.
- Designed and implemented A/B testing strategies and causal inference workflows for model validation and iteration.
- Owned the end-to-end ML solution lifecycle: data engineering, model building, deployment, monitoring, and retraining.
- Led development of NLP models for personalized HCP outreach (Next Best Action System).
- Directed AI teams of 10+ members to deliver high-ROI ML solutions.
Collabera Technologies
Data Engineer | Sep 2020 – Apr 2021
- Built scalable Spark-based pipelines for ingesting multi-format healthcare and insurance datasets.
- Enabled a unified data model utilized for Real World Data assessment.
- Enabled exploratory data analysis through custom-built BI tools using Plotly and Jupyter.
- Reduced the time for overall analysis by 40% compared to the earlier manual process.
LTI Mindtree
Engineer | Sep 2016 – May 2020
- Developed API-integrated machine learning models to support insurance policy breakdown cover recommendation.
- Built real-time monitoring tools and conducted regular model performance reviews to ensure system stability.
- Managed end-to-end AI-driven automation workflows, collaborating with business and QA teams.
Key AI/ML Projects & Solutions 💡
- Retail Analytics Multi-Agent System (GenAI): Designed an LLM-based multi-agent architecture to provide inferential and causal findings for eCommerce sales and marketing analysis.
- GenAI Product Harmonization: Utilized LLM + clustering for attribute extraction to unify retail catalog data, improving data quality and search.
- Bayesian Clinical Trial Forecasting: Developed an MCMC model (using PyMC3) to predict enrolment and attrition rates for clinical trials.
- Store Site Selection & Demand Forecasting: Combined Time Series Forecasting with geospatial analysis (Folium, GeoPandas) to find demand drivers for new store sites.
- Next Best Action System: Implemented NLP models for personalized HCP outreach and recommendation.
Technical Stack ⚙️
Generative AI & LLM Expertise
- Frameworks: Langfuse, LangChain, Langgraph, MCP Server
- Concepts: RAG (Retrieval-Augmented Generation), Vector Database, LLM Prompt Engineering
Machine Learning & Statistical Modelling
- Algorithms: XGBoost, LightGBM, PyMC3, Statsmodels, Clustering (BIRCH), Scikit-learn
- Domains: Time Series Forecasting, Causal Inference, Bayesian Modeling, NLP
Cloud & MLOps Platforms
- Cloud: AWS (EC2, Redshift, Glue, Lambda), GCP (Vertex AI)
- MLOps: Databricks, MLflow, Docker, Git, Jenkins, Trello, Confluence, Jira
- Big Data: PySpark, Pandas, NumPy, ETL Pipelines, SQL (Advanced)
Programming & Data Visualization
- Programming: Python, SQL, Java
- Visualization: Tableau, Plotly, Seaborn, Folium, GeoPandas
Leadership and Strategic Initiatives 📊
- Directed AI teams of 10+ members to deliver high-ROI ML solutions.
- Partnered with stakeholders to define AI strategy, use case roadmap, and adoption KPIs for enterprise client.
- Built scalable, reproducible MLOps pipelines aligned with enterprise architecture.
- Led large-scale cloud migration for legacy data pipelines to AWS cloud and defined the success metrics.
- Resource and timeline estimation for complex cloud solutions and AI-enabled use cases.
Certifications & Education
Certifications
- Certified GenAI Engineer - Databricks
- Certified NLP Developer – Vskills
- Certified Python Developer – Vskills
- Tableau Certified Author- Tableau
- Masters in Applied Data Science I & II – WorldQuant University
- Math for Machine Learning – Amazon
- Certified Agile – ATA
Education
| Program / Qualification | Institution | Year | Score | | :— | :— | :— | :— | | B.Tech, Electronics Engineering | BVDU College of Engineering, Pune | 2016 | 70% | | Diploma, Network Security | BVDU College of Engineering, Pune | 2015 | 83% | | Senior Secondary | Army School, Allahabad | 2011 | 78% | | Higher Secondary | Army School, Udhampur | 2010 | 89% |
Contact & Online Profiles
- Email: singhvks@outlook.in
- LinkedIn: linkedin/in/singhvks
- Phone: +91-9403647912