Sandeep Sah
Architecting enterprise data platforms processing 5TB+ daily for Fortune 500 clients. Specializing in Azure, Snowflake, and modern ETL/ELT pipelines with 99.8% reliability and 90% ETL time reduction.
Technical Expertise
Cloud & Data Platforms
Data Engineering & BI
Data Modeling
Programming & Query
DevOps & Collaboration
Analytics & Reporting
Professional Journey
Data Engineer
- Architected Azure Data Factory pipelines integrating multi-source data from Dynamics 365, Hive, and SQL Server into Snowflake.
- Reduced ETL load times by 90% (from 6h to 2h), processing 5M+ daily records across enterprise data pipelines.
- Optimized 25+ Power BI dashboards using SSAS Tabular and advanced DAX, exceeding SLAs for 500+ users and cutting query time by 45%.
- Achieved 99.8% pipeline reliability and implemented granular Role-Based Access Control (RBAC) for data governance and security.
- Built SQL Agent/Job monitoring with proactive alerting, reducing failure detection by 50% and preventing 95% of SLA breaches.
- Created a data validation framework reducing manual verification by 90% and improving accuracy from 94% to 99.7%.
- Managed architectures supporting $50M+ revenue operations for Fortune 500 clients.
Graduate Engineering Trainee
- Completed 16-week intensive training in full-stack and data tools: Java, SQL, Python, Azure, and BI.
- Delivered capstone project: 'Supermarket Analytics Dashboard' — built on 5 fact tables, 12 dimensions, SCD Type 2 implementation, and 15+ Power BI dashboards.
- Gained hands-on expertise in Azure Data Factory, Synapse Analytics, Power BI, and database design.
Featured Projects
Handwritten Character Recognition
Developed a CNN model for Tirhuta language digit recognition achieving 88–89% accuracy. Published in IEEE, contributing to the preservation and digitization of low-resource languages.
Supermarket Analytics Dashboard
Capstone project built during LTIMindtree training — designed with Kimball Methodology using 5 fact tables, 12 dimension tables, and SCD Type 2 implementation. Delivered 15+ interactive Power BI dashboards for retail analytics insights.
Advertisement Reminder System
Automated advertisement scheduling and reminders for Janakpur Today Media Group using Python. Streamlined workflows saving 20+ hours weekly.
Entertainment Tracker
Personal media tracking system for movies, books, and games with advanced filtering and search capabilities.
Hostel Allocation System
Streamlined room assignment logic with automated allocation algorithms and constraint-based matching.
Blood Donation Management
Optimized donor tracking and blood availability system for hospitals, improving coordination and response times.
Advancing Tirhuta Digit Recognition
“An Empirical Comparison of Handwritten Character Recognition Using Machine Learning”
Addressed the lack of digital resources for the Tirhuta script by developing a Convolutional Neural Network (CNN) capable of recognizing handwritten digits with high precision. This research contributes to the preservation and digitization of low-resource languages, bridging the gap between ancient scripts and modern technology.
Read Paper on IEEE Xplore→Key Achievements
Education & Certifications
Academic Journey
B.E. in Computer Science
CMR Institute of Technology, Bengaluru
Focused on Data Structures, Algorithms, Database Management Systems, and Software Engineering. Published IEEE research paper on Tirhuta character recognition. Graduated with First Class Distinction.
+2 / Intermediate (Science)
Nepal
Completed higher secondary education with focus on Science and Mathematics, building the foundation for engineering studies.
SLC (Secondary Level Certificate)
Nepal
Successfully completed the nationally recognized School Leaving Certificate examination, marking the beginning of the academic journey.
Certifications
Send a Message
Have a project in mind or want to discuss data engineering opportunities? Drop me a message.