Open to Opportunities

Sandeep Sah

> |
Bengaluru, Karnataka, India

Architecting enterprise data platforms processing 5TB+ daily for Fortune 500 clients. Specializing in Azure, Snowflake, and modern ETL/ELT pipelines with 99.8% reliability and 90% ETL time reduction.

0TB+
Data Processed Daily
Enterprise-scale pipelines
0%
Load Time Reduction
ETL optimized from 6h to 2h
0%
Pipeline Reliability
With RBAC governance
0%
Query Time Cut
SSAS Tabular + DAX tuning

Technical Expertise

Cloud & Data Platforms

Azure Data FactoryADLS Gen2Azure Synapse AnalyticsSnowflakeSQL ServerDynamics 365Hive

Data Engineering & BI

ETL / ELT PipelinesSSISSSAS TabularPower BIDBTKimball MethodologyAdvanced DAX

Data Modeling

Star SchemaSnowflake SchemaSCD Type 2Dimensional ModelingFact & Dimension TablesData Warehousing

Programming & Query

Python (Pandas / NumPy)T-SQLSQLDAXJavaTensorFlow / CNN

DevOps & Collaboration

Azure DevOps (CI/CD)Git / GitHubAgile / ScrumJIRARBAC & Governance

Analytics & Reporting

Power BI DashboardsSQL Agent MonitoringData Validation FrameworksSLA ComplianceProactive Alerting

Professional Journey

Data Engineer

LTIMindtree
Scania CV AB Sweden
Full-time
Sep 2022 – July 2025
  • Architected Azure Data Factory pipelines integrating multi-source data from Dynamics 365, Hive, and SQL Server into Snowflake.
  • Reduced ETL load times by 90% (from 6h to 2h), processing 5M+ daily records across enterprise data pipelines.
  • Optimized 25+ Power BI dashboards using SSAS Tabular and advanced DAX, exceeding SLAs for 500+ users and cutting query time by 45%.
  • Achieved 99.8% pipeline reliability and implemented granular Role-Based Access Control (RBAC) for data governance and security.
  • Built SQL Agent/Job monitoring with proactive alerting, reducing failure detection by 50% and preventing 95% of SLA breaches.
  • Created a data validation framework reducing manual verification by 90% and improving accuracy from 94% to 99.7%.
  • Managed architectures supporting $50M+ revenue operations for Fortune 500 clients.
Click to collapse

Graduate Engineering Trainee

LTIMindtree
Training
May 2022 – Sep 2022
  • Completed 16-week intensive training in full-stack and data tools: Java, SQL, Python, Azure, and BI.
  • Delivered capstone project: 'Supermarket Analytics Dashboard' — built on 5 fact tables, 12 dimensions, SCD Type 2 implementation, and 15+ Power BI dashboards.
  • Gained hands-on expertise in Azure Data Factory, Synapse Analytics, Power BI, and database design.
Click to expand details

Featured Projects

📄 IEEE Research Publication

Handwritten Character Recognition

Developed a CNN model for Tirhuta language digit recognition achieving 88–89% accuracy. Published in IEEE, contributing to the preservation and digitization of low-resource languages.

PythonCNNDeep LearningTensorFlowMachine Learning
Read on IEEE Xplore
Data Warehouse & BI
Capstone

Supermarket Analytics Dashboard

Capstone project built during LTIMindtree training — designed with Kimball Methodology using 5 fact tables, 12 dimension tables, and SCD Type 2 implementation. Delivered 15+ interactive Power BI dashboards for retail analytics insights.

Power BIKimballSCD Type 2Star SchemaSQL
Automation

Advertisement Reminder System

Automated advertisement scheduling and reminders for Janakpur Today Media Group using Python. Streamlined workflows saving 20+ hours weekly.

PythonAutomationScheduling
Full Stack

Entertainment Tracker

Personal media tracking system for movies, books, and games with advanced filtering and search capabilities.

ReactNode.jsMongoDB
Database Engineering

Hostel Allocation System

Streamlined room assignment logic with automated allocation algorithms and constraint-based matching.

SQLDatabase DesignBackend
Healthcare IT

Blood Donation Management

Optimized donor tracking and blood availability system for hospitals, improving coordination and response times.

DatabaseHealthcare ITCRUD
IEEE Publication

Advancing Tirhuta Digit Recognition

“An Empirical Comparison of Handwritten Character Recognition Using Machine Learning”

Addressed the lack of digital resources for the Tirhuta script by developing a Convolutional Neural Network (CNN) capable of recognizing handwritten digits with high precision. This research contributes to the preservation and digitization of low-resource languages, bridging the gap between ancient scripts and modern technology.

Read Paper on IEEE Xplore

Key Achievements

Model Architecture
Custom CNN
Accuracy Achieved
88–89%
Script
Tirhuta (Maithili)
Impact
Low-resource Language Digitization
Model Accuracy88–89%

Education & Certifications

Academic Journey

B.E. in Computer Science

CMR Institute of Technology, Bengaluru

2018 – 2022

Focused on Data Structures, Algorithms, Database Management Systems, and Software Engineering. Published IEEE research paper on Tirhuta character recognition. Graduated with First Class Distinction.

+2 / Intermediate (Science)

Nepal

2016 – 2018

Completed higher secondary education with focus on Science and Mathematics, building the foundation for engineering studies.

SLC (Secondary Level Certificate)

Nepal

Completed 2016

Successfully completed the nationally recognized School Leaving Certificate examination, marking the beginning of the academic journey.

Certifications

Azure Data Engineer Associate (DP-203)
professional
SnowPro Core Associate
professional
Azure Data Fundamentals (DP-900)
foundational
Microsoft Azure Fundamentals (AZ-900)
foundational
SQL Server Development
professional
Generative AI Fundamentals
foundational

Send a Message

Have a project in mind or want to discuss data engineering opportunities? Drop me a message.