Vikas Kumar Singh
Data Architect • Principal AI Engineer • GenAI Expert
Bangalore, India
Profile
Principal Data Architect and AI Engineering Manager with 10 years of experience designing and delivering enterprise-scale data platforms and AI systems. Deep specialization in the Databricks Lakehouse ecosystem, including Unity Catalog governance, Mosaic AI, and production-grade GenAI systems.
Core Expertise
Data Platforms
- AWS Glue, Redshift
- Databricks Lakehouse
- Unity Catalog, Delta Lake
- Enterprise Data Modeling
Generative AI
- Multi-Agent Systems
- Mosaic AI, Vertex AI
- Azure OpenAI
- RAG Architecture
- Vector Databases
Engineering
- Python (FastAPI, Async)
- PySpark, MLOps
- Docker, Kubernetes
- CI/CD & Model Lifecycle
Selected Case Studies
Principal Architect - Multi-Agent GenAI Platform
Designed and delivered a high-concurrency (>1,000 users) GenAI platform for executive analytics and compliance reporting, delivering $100K+ annual savings.
Read Case StudyLead Architect - Geospatial ML Site Selection
Architected a Databricks Lakehouse–based geospatial ML platform processing >1TB of data, reducing site approval cycles by ~70%.
Read Case StudyProfessional Experience
Tredence Analytics - Data Science Manager
Bangalore, India • Nov 2024 – Present
- Lead architect for enterprise AI and data platform initiatives, managing cross-functional teams.
- Led migration from legacy AWS pipelines to Databricks Lakehouse.
- Implemented cost governance and workload optimization to control DBU spend.
ZS Associates - Business Technology Solutions Consultant
Pune, India • Apr 2021 – May 2024
- Designed AWS Glue ETL pipelines integrated with Bayesian (MCMC) forecasting models.
- Implemented Next Best Action recommendation models interfacing Veeva CRM with MLflow monitoring.
- Migrated EDLS pipelines with automated data quality checks and alerting to AWS Glue.
Collabera Technologies - Data Engineer
Pune, India • Sep 2020 – Apr 2021
- Built Spark-based HIPAA compliant data pipelines for large-scale healthcare datasets.
- Developed unified Real-World Data (RWD) analytics dashboard with zero cost overhead.
- Implemented PII masking and access controls for HIPAA-compliant analytics.
L&T Infotech - Engineer
Pune, India • Sep 2016 – May 2020
- Developed API-integrated ML systems for insurance policy recommendations with A/B testing.
- Built automated high-throughput API testing frameworks.
- Implemented on-prem ETL pipelines supporting GDPR and solvency compliance.
Certifications
- GenAI Solutions Architect (2026)
- Databricks Certified GenAI Engineer (2025)
- AI/ML for Geodata Analysis - ISRO
- Master’s in Applied Data Science - WorldQuant University
- Certified NLP & Python Developer
Education
- B.Tech, Electronics Engineering
BVDU College of Engineering, Pune (2016) - Diploma, Network Security
BVDU College of Engineering (2015)