Vikas Kumar Singh
Data Architect • Principal AI Engineer • Databricks GenAI Expert
Bangalore, India (Open to Relocation)
📧 singhvks@outlook.in 🔗 LinkedIn · Portfolio
Legal & Work Authorization: EU HSM–eligible (Netherlands), UAE employment visa support feasible
Profile
Principal Data Architect and AI Engineering Manager with 9+ years of experience designing and delivering enterprise-scale data platforms and AI systems. Deep specialization in the Databricks Lakehouse ecosystem, including Unity Catalog governance, Mosaic AI, and production-grade GenAI systems.
Experienced in building and operating platforms used by large business teams under strict regulatory and data privacy constraints (GDPR, HIPAA, FTC). Work spans high-scale architectures, long-lived data products, and globally distributed teams.
Core Expertise
Data & Platform Architecture
- Databricks Lakehouse (Medallion Architecture)
- Unity Catalog, Delta Lake
- Enterprise Data Modeling
Generative AI & Applied ML
- Mosaic AI
- Multi-Agent Systems (LangChain, LangGraph)
- Vector Databases
- RAG Architecture & Cost Optimization
Engineering & MLOps
- Python (FastAPI, Async)
- PySpark, Advanced SQL
- Docker, MLflow, Langflow
- CI/CD and Model Lifecycle Management
Cloud Platforms
- AWS (Glue, Redshift), Azure OpenAI, GCP Airflow
Selected Architecture Case Studies
Production systems designed under real-world scale, governance, and cost constraints
👉 Read more detailed case studies
Principal Architect - Multi-Agent GenAI Analytics Platform (eCommerce)
Designed and delivered a high-concurrency (>1,000 users) GenAI platform for executive analytics, compliance reporting, and decision support, replacing fragmented manual workflows.
- FastAPI- and Docker-based multi-agent service architecture
- Unified AI endpoint consumed by Tableau, Excel, and React frontends
- Multi-layer caching, RAG strategies, and prompt governance for cost and latency control
- Enterprise SSO–based access control and auditability
Outcome
- Automated 60+ recurring analytical reports
- Eliminated manual review cycles, delivering $100K+ annual cost savings
Lead Architect - Geospatial ML Site Selection & Sales Forecasting (Retail)
Architected a Databricks Lakehouse–based geospatial ML platform to support new-store site selection for a large-scale convenience retail network.
- Medallion architecture processing >1TB of transactional, mobility, census, and infrastructure data
- Spatial feature engineering using drive-time isochrones and radial trade areas
- Cold-start forecasting via clustering-based statistical twin modeling
- MLflow- and Unity Catalog–driven governance, lineage, and model lifecycle management
Outcome
- Standardized quantitative benchmarking for new-store feasibility
- Reduced site approval decision cycles by ~70%
- Enabled explainable, defensible forecasts for CAPEX decisions
Professional Experience
Tredence Analytics - Data Science Manager
Bangalore, India • Nov 2024 – Present
- Lead architect for enterprise AI and data platform initiatives
- Managed and mentored a cross-functional team of engineers and data scientists
- Led migration from legacy AWS pipelines to Databricks Lakehouse
- Implemented cost governance and workload optimization to control DBU spend
ZS Associates - Business Technology Solutions Consultant
Pune, India • Apr 2021 – May 2024
- Designed AWS Glue ETL pipelines integrated with Bayesian (MCMC) forecasting models for clinical supply planning
- Implemented MLflow-based model monitoring and production serving via SAP-IBP
- Migrated EDLS pipelines with data quality checks and automated alerting
Collabera Technologies - Data Engineer
Pune, India • Sep 2020 – Apr 2021
- Built Spark-based pipelines for large-scale healthcare datasets
- Developed unified Real-World Data (RWD) models for analytics
- Implemented PII masking and access controls for HIPAA compliance
LTI Mindtree - Engineer
Pune, India • Sep 2016 – May 2020
- Developed API-integrated ML systems for insurance policy recommendations
- Built automated high-throughput API testing frameworks
- Implemented on-prem ETL pipelines supporting GDPR and solvency compliance
Certifications
- Databricks Certified GenAI Engineer (2025)
- Certified NLP Developer - Vskills
- Certified Python Developer - Vskills
- Master’s in Applied Data Science I & II - WorldQuant University
- Math for Machine Learning - Amazon
Education
- B.Tech, Electronics Engineering - BVDU College of Engineering, Pune (2016)
- Diploma, Network Security - BVDU College of Engineering, Pune (2015)
- Secondary Education - Army School, Allahabad (2011)
Languages
- English - Full Professional Proficiency (C2)
Currently open to senior individual contributor or principal-level roles in EU and Middle East technology teams.