I help teams design and maintain modern data platforms that are scalable, reliable, and easy to operate. With hands-on experience in Snowflake, Iceberg, Airflow, and Databricks, I enjoy solving complex data problems and turning them into simple, production-ready solutions.

Karan Vijay Singh

I help teams design and maintain modern data platforms that are scalable, reliable, and easy to operate. With hands-on experience in Snowflake, Iceberg, Airflow, and Databricks, I enjoy solving complex data problems and turning them into simple, production-ready solutions.

Available to hire

I help teams design and maintain modern data platforms that are scalable, reliable, and easy to operate. With hands-on experience in Snowflake, Iceberg, Airflow, and Databricks, I enjoy solving complex data problems and turning them into simple, production-ready solutions.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
See more

Language

English
Fluent

Work Experience

Data Engineer at DoorDash
December 31, 2023 - Present
Developed an enhanced Snowflake target load operator enabling Iceberg table management; migrated existing Snowflake-native jobs to Iceberg-managed tables; built migration workflows with Airflow DAGs and SQL; enabled governance-driven analytics and improved data quality.
Senior Data Developer at Lightspeed HQ
June 1, 2020 - October 1, 2023
Optimized queries over hundreds of GB of data to build persistent derived tables (PDTs) in Looker, reducing runtime and cost by 50%; configured alerts for critical events and improved dashboard reliability.
Data Engineer at Ross Intelligence
March 1, 2020 - April 1, 2020
Enhanced an Apache Beam data pipeline processing thousands of XML files to include sort order in the final schema; reduced run time from 6 hours to 30 minutes.
Data Scientist (Intern) at Amazon
September 1, 2019 - December 1, 2019
Processed and analyzed clickstream and orders data at scale using AWS services (S3, Redshift, EMR, EC2), PySpark, NumPy, Pandas, and Python; built scalable data workflows.
Software Developer (Data) at Ubilab
May 1, 2019 - August 1, 2019
Collected air quality data from IoT sensors in Mongolia for UNICEF; ingested into Elasticsearch via Logstash; visualized in Kibana.
Data Scientist at Scotia Bank (Coop)
January 1, 2019 - April 1, 2019
Forecasted trends in Chequing account balances for a 30-day window using ARIMA/LSTM on detrended time series; achieved MAPE 4.97%; developed data features using XGBoost.
Product Development Engineer at MFS (Mahindra Comviva)
July 31, 2015 - June 30, 2017
Built an automated Jenkins-based solution for creating product builds; supported 150+ million subscribers; analyzed large structured/unstructured datasets; contributed to research and development.

Education

M Math in Computer Science - Data Science Specialization at University of Waterloo
September 1, 2017 - August 1, 2019
B.E. in Computer Science & Engineering at Thapar University, Patiala, India
July 1, 2011 - July 1, 2015

Qualifications

Certified Microsoft Technology Associate: Software Development Fundamentals
June 1, 2014 - January 29, 2026
Graduate Research Scholarship
January 1, 2019 - April 1, 2019
Millennium Graduate Bursary
January 1, 2018 - April 1, 2018
Technical Reviewer: Machine Learning in Production (Book, 2023)
January 1, 2023 - December 31, 2023

Industry Experience

Software & Internet, Media & Entertainment, Professional Services, Financial Services, Education