I am a Data Engineer with a strong academic background and hands-on expertise in batch, streaming, and event-driven data pipelines. I specialize in data transformation using Python, building robust data models, monitoring pipelines, and ensuring data quality, from ingestion to visualization. I have hands-on experience with AWS Glue (PySpark), Airflow, Snowflake, Pandas, and S3-based data lakes, delivering scalable solutions that improve business decisions. Collaborative by nature, I work closely with stakeholders to translate requirements into reliable data infrastructure. My track record includes reducing processing times, improving throughput, and maintaining high uptime through testing and automation. I’m passionate about clean code, documentation, and mentoring teammates to raise our data capabilities.

Sandeep Dudhraj

I am a Data Engineer with a strong academic background and hands-on expertise in batch, streaming, and event-driven data pipelines. I specialize in data transformation using Python, building robust data models, monitoring pipelines, and ensuring data quality, from ingestion to visualization. I have hands-on experience with AWS Glue (PySpark), Airflow, Snowflake, Pandas, and S3-based data lakes, delivering scalable solutions that improve business decisions. Collaborative by nature, I work closely with stakeholders to translate requirements into reliable data infrastructure. My track record includes reducing processing times, improving throughput, and maintaining high uptime through testing and automation. I’m passionate about clean code, documentation, and mentoring teammates to raise our data capabilities.

Available to hire

I am a Data Engineer with a strong academic background and hands-on expertise in batch, streaming, and event-driven data pipelines. I specialize in data transformation using Python, building robust data models, monitoring pipelines, and ensuring data quality, from ingestion to visualization. I have hands-on experience with AWS Glue (PySpark), Airflow, Snowflake, Pandas, and S3-based data lakes, delivering scalable solutions that improve business decisions.

Collaborative by nature, I work closely with stakeholders to translate requirements into reliable data infrastructure. My track record includes reducing processing times, improving throughput, and maintaining high uptime through testing and automation. I’m passionate about clean code, documentation, and mentoring teammates to raise our data capabilities.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
See more

Language

English
Fluent

Work Experience

Data Engineer at BHP
February 1, 2022 - Present
Collaborate with stakeholders to understand data-related technical requirements and support the data infrastructure needs, facilitating the processing of over 10M+ data per month. Write unit tests, debug data pipeline issues, and document processes following rigorous testing guidelines to maintain a 99.9% uptime. Develop AWS Lambda functions, Spark jobs, and Airflow pipelines, reducing data processing times by 30% and re-design components to scale throughput by 25%.
Data & Software Engineer at Tekvortex Pvt. Ltd.
November 1, 2017 - February 1, 2020
Augmented 50+ features for cloud migration, developed 20+ data processing stacks on Python/R on EC2, and built supervised models predicting usage for 100+ companies with ~78% accuracy. Strengthened ETL pipelines via AWS Glue, decreasing ingestion ETA from 8 hours to ~1.3 hours.
Data Scientist - Researcher at Innovation Central Perth
June 1, 2021 - February 1, 2022
Designed and constructed 10+ analytics dashboards with Plotly; performed ETL on 500k real-world data points; improved model metrics (AIC, BIC, R2) by 7–15% through parameter tuning; modeled 20+ ML algorithms for time-series classifications, regression, and clustering.

Education

Master of Predictive Analysis (Data Science) at Curtin University, Perth, WA
February 1, 2020 - November 1, 2021
Bachelor’s Computer Engineering at IOE Central Campus (Tribhuwan University), Nepal
January 1, 2013 - January 1, 2017
Master of Predictive Analysis (Data Science) at Curtin University
February 1, 2020 - November 1, 2021
Bachelor’s Computer Engineering at IOE Central Campus (Tribhuwan University), Nepal
January 1, 2013 - January 1, 2017

Qualifications

AWS Certified Developer – Associate
January 1, 2022 - February 3, 2026
Six Stars (Algorithm) - HackerRank
July 1, 2020 - February 3, 2026
AWS Developer
January 1, 2022 - February 3, 2026

Industry Experience

Software & Internet, Professional Services, Agriculture & Mining, Media & Entertainment