I'm Nithin Reddy Desani, a Senior Data Engineer with over 10 years of experience designing, building, and optimizing large-scale data ecosystems across AWS, Azure, and Snowflake. I specialize in architecting cloud-native data platforms, ETL/ELT pipelines, and ML workflows to power analytics, AI, and revenue-generating platforms across life sciences, energy, healthcare, and technology. My toolkit includes Databricks, Airflow, DBT, and Python to deliver end-to-end data solutions that improve data accessibility, scalability, and governance, often delivering up to 75% efficiency gains and multi-million-dollar revenue growth. I collaborate cross-functionally to translate complex data challenges into actionable business insights. I have a strong background in data modeling (Star Schema, Data Vault 2.0), MLOps, and LLM-based solutions, and I have led initiatives on data governance and quality. I’ve built self-hosted model services with fine-tuned LLMs, integrated human feedback, and shipped ML and analytics solutions that accelerate decision-making and business impact at scale.

Nithin Reddy Desani

I'm Nithin Reddy Desani, a Senior Data Engineer with over 10 years of experience designing, building, and optimizing large-scale data ecosystems across AWS, Azure, and Snowflake. I specialize in architecting cloud-native data platforms, ETL/ELT pipelines, and ML workflows to power analytics, AI, and revenue-generating platforms across life sciences, energy, healthcare, and technology. My toolkit includes Databricks, Airflow, DBT, and Python to deliver end-to-end data solutions that improve data accessibility, scalability, and governance, often delivering up to 75% efficiency gains and multi-million-dollar revenue growth. I collaborate cross-functionally to translate complex data challenges into actionable business insights. I have a strong background in data modeling (Star Schema, Data Vault 2.0), MLOps, and LLM-based solutions, and I have led initiatives on data governance and quality. I’ve built self-hosted model services with fine-tuned LLMs, integrated human feedback, and shipped ML and analytics solutions that accelerate decision-making and business impact at scale.

Available to hire

I’m Nithin Reddy Desani, a Senior Data Engineer with over 10 years of experience designing, building, and optimizing large-scale data ecosystems across AWS, Azure, and Snowflake. I specialize in architecting cloud-native data platforms, ETL/ELT pipelines, and ML workflows to power analytics, AI, and revenue-generating platforms across life sciences, energy, healthcare, and technology. My toolkit includes Databricks, Airflow, DBT, and Python to deliver end-to-end data solutions that improve data accessibility, scalability, and governance, often delivering up to 75% efficiency gains and multi-million-dollar revenue growth.

I collaborate cross-functionally to translate complex data challenges into actionable business insights. I have a strong background in data modeling (Star Schema, Data Vault 2.0), MLOps, and LLM-based solutions, and I have led initiatives on data governance and quality. I’ve built self-hosted model services with fine-tuned LLMs, integrated human feedback, and shipped ML and analytics solutions that accelerate decision-making and business impact at scale.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate

Language

English
Fluent

Work Experience

Senior Data Engineer
October 1, 2025 - October 1, 2025
Led the development of an AI-based mobile app predicting vehicle tire quality with >90% accuracy using LLM, improved scalability by restructuring APIs and separating services. Architected ETL with AWS Glue into Redshift, boosting BI efficiency by 75%. Initiated a project mapping plugin explanations tolerant to MITRE/ATT&ACK techniques via statistical analysis, NLP, deep learning, and LLMs. Built automated ML data pipelines and robust ETL functions. Implemented robust Star Schema and Data Vault 2.0 data models in Azure Synapse Analytics. Used Python in AWS Lambda for healthcare data processing with HIPAA compliance. Built & maintained scalable data pipelines with Airflow, DBT, SQL, Python, and AWS to integrate data into Snowflake powering a $200M lead-generation product. Trained and deployed revenue-prediction ML models using Snowflake Snowpark for 18M+ companies. Delivered 5-7% incremental revenue gains and reduced provision costs via advanced data analysis for ENEL’s contact center.
Data Engineer at Microsoft
August 1, 2022 - August 1, 2022
Led data quality and lineage governance initiatives, achieving ~35% improvement in data accuracy. Designed a modern Azure-based data warehouse using Synapse, developed end-to-end pipelines with Azure Data Factory, and leveraged Databricks for cleansing, enrichment, and aggregation. Implemented a single source of truth data lake across Coca-Cola ecosystem, standardizing reporting and resolving inconsistencies. Built ETL pipelines with Spark and Airflow, performed A/B testing to optimize recommendations for Discovery Plus, and developed an Auto-AI system to train and select forecasting models. Automated weekly ETL processes for a wellness brand and conducted rigorous data validation and QA. Documented cloud data warehouse platforms and demonstrated data modeling using Star and Snowflake schemas in AWS Redshift for healthcare analytics.
Data Engineer at Van Wagenen
May 1, 2019 - May 1, 2019
Developed real-time and batch ETL pipelines to load XML files to staging, relational DB, and HDFS after transformation. Created a database for Ontario government stakeholders to optimize truck/load configurations. Expanded Spark/Hadoop (Scala/Python) data infrastructures on AWS, and supported Data Scientists with IAM, S3, DynamoDB, RDS, and Lambda. Used data modeling techniques (star, snowflake, and data vaults) to enhance storage and reporting. Built pipelines to extract/transform/load data from APIs, RDBMS, and flat files into Snowflake. Created ETL scripts and ensured data integrity for scalable analytics.
BI Engineer at Microsoft
December 1, 2018 - December 1, 2018
Engineered a metadata-driven streaming data framework in Azure for reliable data processing and business logic execution, with enrichment, transformation, and aggregation. Contributed to Azure Anomaly Detector features and built an IoT analytics platform in AWS from ingestion to visualization to reduce downtime. Extracted data from Oracle and built test datasets, developed ETL pipelines in SnapLogic, mentored six new hires, and designed multi-zone ADLS storage with CosmosDB integration. Implemented ADLS-based data warehousing (Azure Synapse, Data Factory) and provided data access layers using Dapper, EF Core, and MongoDB. Optimized ETL pipelines for Discovery Plus and led a multi-tenant data strategy for marketing analytics.
Development Database Administrator at Medtronic
May 1, 2017 - May 1, 2017
Led end-to-end SDLC with a modern tech stack (JavaScript/TypeScript, React, Redux, Gatsby, Node.js, Express, Lambda). Drove data ingestion from IBM DB2 and Oracle into ADLS via Azure Data Factory; implemented CI/CD with Jenkins; enhanced Python-based data pipelines for reliability. Built dashboards with Power BI and QlikView; authored SQL queries for PostgreSQL and integrated PySpark data transformations. Led the Manufacturing Optimization Platform using React and .NET for real-time production monitoring, optimized complex SQL schemas, and delivered dashboards for marketing analytics.

Education

Master of Science in Computer Science at Rivier University
January 1, 2014 - May 1, 2015
Bachelor of Science in Computer Science at Jawaharlal Nehru Technological University
September 1, 2009 - July 1, 2013

Qualifications

AWS Certified Data Engineer – Associate
November 1, 2022 - November 1, 2025
Microsoft Certified: Azure Data Engineer Associate
September 1, 2017 - September 1, 2022

Industry Experience

Life Sciences, Energy & Utilities, Healthcare, Software & Internet, Professional Services