Hi, I'm Karan Sangha, a data engineer with a passion for building efficient data pipelines and platforms. I specialize in leveraging AWS, Terraform, Airflow, and dbt to automate workflows, optimize performance, and reduce costs. I enjoy modernizing data infrastructure to enable seamless integration, transformation, and analysis of data, providing scalable solutions for complex business challenges. With experience across different industries, I consistently work closely with analysts and data scientists to build data models and optimize query performance. I'm driven by making data more accessible and useful for business teams to generate faster insights and achieve impactful results.

Karan Sangha

Hi, I'm Karan Sangha, a data engineer with a passion for building efficient data pipelines and platforms. I specialize in leveraging AWS, Terraform, Airflow, and dbt to automate workflows, optimize performance, and reduce costs. I enjoy modernizing data infrastructure to enable seamless integration, transformation, and analysis of data, providing scalable solutions for complex business challenges. With experience across different industries, I consistently work closely with analysts and data scientists to build data models and optimize query performance. I'm driven by making data more accessible and useful for business teams to generate faster insights and achieve impactful results.

Available to hire

Hi, I’m Karan Sangha, a data engineer with a passion for building efficient data pipelines and platforms. I specialize in leveraging AWS, Terraform, Airflow, and dbt to automate workflows, optimize performance, and reduce costs. I enjoy modernizing data infrastructure to enable seamless integration, transformation, and analysis of data, providing scalable solutions for complex business challenges.

With experience across different industries, I consistently work closely with analysts and data scientists to build data models and optimize query performance. I’m driven by making data more accessible and useful for business teams to generate faster insights and achieve impactful results.

See more

Experience Level

Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate

Language

Javanese
Intermediate

Work Experience

Data Engineer at StellarAlgo
April 30, 2025 - July 18, 2025
Developed and maintained scalable ELT pipelines using SQL and dbt within AWS Redshift, improving query performance and reducing data latency for analytics. Developed and deployed Airflow DAGs for automated ELT processes, ensuring efficient and scalable data transformation workflows. Generated over $1 million in annual revenue by launching a self-service platform in Q4 2024, streamlining integration and empowering the sales team. Accelerated onboarding time from 12 weeks to 1 week and increased client satisfaction by architecting and deploying the self-service data ingestion platform. Partnered with analysts and data scientists to build data models and optimize query performance, enabling faster insights for business teams. Reduced data pipeline processing times by 40% and saved over $750k annually in AWS costs by transitioning to a modern data stack using AWS, Python, Data Lakehouse, and dbt. Led adoption of coding conventions with tools like Ruff for Python linting and formatting, imp
Data Scientist at British Columbia Maritime Employers Association
February 1, 2022 - July 18, 2025
Boosted query execution speeds by 2-3x and reduced on-demand compute time by optimizing SQL database performance, including implementing materialized views and transitioning to a star schema. Developed and maintained SQL-based ELT pipelines improving operational analytics. Modernized data ingestion from Excel to Python and Airflow, automating workflows and reducing manual effort. Developed interactive Tableau dashboards for real-time decision-making by executive leadership. Increased ROI on training programs by 17% and saved $200k+ in workforce planning through optimized trainee pipeline management. Improved forecast accuracy by 87% by leading the development of labor demand prediction models. Enhanced team visibility and project tracking by implementing JIRA for sprint planning and task management.
Data Engineer at StellarAlgo
April 30, 2025 - July 18, 2025
Developed and maintained scalable ELT pipelines using SQL and dbt within AWS Redshift, improving query performance and reducing data latency for analytics. Developed and deployed Airflow DAGs for automated ELT processes, ensuring efficient and scalable data transformation workflows. Generated over $1 million in annual revenue by launching a self-service platform integrated with Airflow, unlocking new business opportunities and empowering the sales team. Accelerated onboarding time from 12 weeks to 1 week by architecting and deploying the self-service data ingestion platform, enabling clients to load data from their own systems and map it to predefined models. Partnered with analysts and data scientists to build data models and optimize query performance for faster insights. Reduced data pipeline processing times by 40% and saved over $750k annually in AWS costs by transitioning to a modern data stack. Standardized client deployments using Terraform, Airflow, and dbt. Improved code cons
Data Scientist at British Columbia Maritime Employers Association
February 1, 2022 - July 18, 2025
Boosted query execution speeds by 2-3x and reduced on-demand compute time by optimizing SQL database performance, implementing materialized views, and transitioning to a star schema. Developed and maintained SQL-based ELT pipelines, improving data accessibility and reliability. Modernized data ingestion processes from Excel to Python and Airflow, automating orchestration of data from SFTP and inboxes. Developed interactive Tableau dashboards enabling real-time decision-making for executive leadership, integrated into daily operations. Collaborated with Training department to optimize trainee pipeline, increasing ROI by 17% and saving over $200k in workforce planning. Led development of a labor demand prediction model, improving forecast accuracy by 87% and enabling data-driven simulations for resource allocation. Implemented JIRA for sprint planning and task management, improving team visibility and project tracking.

Education

MSc at University of Colorado Boulder, CO, USA
January 1, 2023 - April 30, 2023
BSc at Simon Fraser University, Burnaby, Canada
January 1, 2019 - April 30, 2019
MSc Data Science at University of Colorado Boulder
January 1, 2023 - April 30, 2023
BSc Data Science at Simon Fraser University
January 1, 2019 - April 30, 2019

Qualifications

Add your qualifications or awards here.

Industry Experience

Software & Internet, Retail, Professional Services, Government

Experience Level

Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate

Hire a Data Scientist

We have the best data scientist experts on Twine. Hire a data scientist in Montréal today.