Vikas Kumar Singh

Data Architect • Principal AI Engineer • GenAI Expert

Bangalore, India

Profile

Principal Data Architect and AI Engineering Manager with 10 years of experience designing and delivering enterprise-scale data platforms and AI systems. Deep specialization in the Databricks Lakehouse ecosystem, including Unity Catalog governance, Mosaic AI, and production-grade GenAI systems.

Core Expertise

Data Platforms

  • AWS Glue, Redshift
  • Databricks Lakehouse
  • Unity Catalog, Delta Lake
  • Enterprise Data Modeling

Generative AI

  • Multi-Agent Systems
  • Mosaic AI, Vertex AI
  • Azure OpenAI
  • RAG Architecture
  • Vector Databases

Engineering

  • Python (FastAPI, Async)
  • PySpark, MLOps
  • Docker, Kubernetes
  • CI/CD & Model Lifecycle

Selected Case Studies

Principal Architect - Multi-Agent GenAI Platform

Designed and delivered a high-concurrency (>1,000 users) GenAI platform for executive analytics and compliance reporting, delivering $100K+ annual savings.

Read Case Study

Lead Architect - Geospatial ML Site Selection

Architected a Databricks Lakehouse–based geospatial ML platform processing >1TB of data, reducing site approval cycles by ~70%.

Read Case Study
View All Case Studies →

Professional Experience

Tredence Analytics - Data Science Manager

Bangalore, India • Nov 2024 – Present

  • Lead architect for enterprise AI and data platform initiatives, managing cross-functional teams.
  • Led migration from legacy AWS pipelines to Databricks Lakehouse.
  • Implemented cost governance and workload optimization to control DBU spend.

ZS Associates - Business Technology Solutions Consultant

Pune, India • Apr 2021 – May 2024

  • Designed AWS Glue ETL pipelines integrated with Bayesian (MCMC) forecasting models.
  • Implemented Next Best Action recommendation models interfacing Veeva CRM with MLflow monitoring.
  • Migrated EDLS pipelines with automated data quality checks and alerting to AWS Glue.

Collabera Technologies - Data Engineer

Pune, India • Sep 2020 – Apr 2021

  • Built Spark-based HIPAA compliant data pipelines for large-scale healthcare datasets.
  • Developed unified Real-World Data (RWD) analytics dashboard with zero cost overhead.
  • Implemented PII masking and access controls for HIPAA-compliant analytics.

L&T Infotech - Engineer

Pune, India • Sep 2016 – May 2020

  • Developed API-integrated ML systems for insurance policy recommendations with A/B testing.
  • Built automated high-throughput API testing frameworks.
  • Implemented on-prem ETL pipelines supporting GDPR and solvency compliance.

Certifications

  • GenAI Solutions Architect (2026)
  • Databricks Certified GenAI Engineer (2025)
  • AI/ML for Geodata Analysis - ISRO
  • Master’s in Applied Data Science - WorldQuant University
  • Certified NLP & Python Developer

Education

  • B.Tech, Electronics Engineering
    BVDU College of Engineering, Pune (2016)
  • Diploma, Network Security
    BVDU College of Engineering (2015)