I am a versatile data engineer with two years of experience specializing in Python, SQL, PySpark, and data warehousing solutions. I have a strong background in designing and building efficient data pipelines using tools like Airflow, DBT, and cloud technologies such as AWS and GCP. My expertise includes migrating data warehouses to Snowflake, which significantly improved query performance and reduced costs. I am passionate about leveraging cloud platforms and modern orchestration tools to optimize data processing workflows. I have hands-on experience in maintaining ETL pipelines, implementing data quality checks, and building APIs to power machine learning models. Continuously learning and adapting, I seek to contribute effectively in fast-paced technical environments and thrive on solving complex data problems.

Karpagapriya Dhanraj

I am a versatile data engineer with two years of experience specializing in Python, SQL, PySpark, and data warehousing solutions. I have a strong background in designing and building efficient data pipelines using tools like Airflow, DBT, and cloud technologies such as AWS and GCP. My expertise includes migrating data warehouses to Snowflake, which significantly improved query performance and reduced costs. I am passionate about leveraging cloud platforms and modern orchestration tools to optimize data processing workflows. I have hands-on experience in maintaining ETL pipelines, implementing data quality checks, and building APIs to power machine learning models. Continuously learning and adapting, I seek to contribute effectively in fast-paced technical environments and thrive on solving complex data problems.

Available to hire

I am a versatile data engineer with two years of experience specializing in Python, SQL, PySpark, and data warehousing solutions. I have a strong background in designing and building efficient data pipelines using tools like Airflow, DBT, and cloud technologies such as AWS and GCP. My expertise includes migrating data warehouses to Snowflake, which significantly improved query performance and reduced costs.

I am passionate about leveraging cloud platforms and modern orchestration tools to optimize data processing workflows. I have hands-on experience in maintaining ETL pipelines, implementing data quality checks, and building APIs to power machine learning models. Continuously learning and adapting, I seek to contribute effectively in fast-paced technical environments and thrive on solving complex data problems.

See more

Experience Level

Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
See more

Language

French
Intermediate
English
Advanced

Work Experience

Data Engineer at Ledger
December 31, 2024 - August 27, 2025
Developed and maintained ETL pipelines using Python, SQL, and AWS Glue to extract, transform, and load data from APIs, increasing processing efficiency by 15% for 1 TB monthly data. Migrated and optimized data pipelines to AWS S3 and Snowflake, reducing infrastructure costs by 20% and boosting throughput by 25%, handling up to 1 TB daily. Implemented data quality checks and monitoring procedures which reduced data errors by 10%, ensuring data integrity for downstream analytics.
Junior Software Engineer at InFynd
January 1, 2023 - August 27, 2025
Developed data-driven front-end features using Angular to enhance user management and interaction with product data. Managed and maintained AWS instances to ensure availability of data-intensive application features.

Education

Masters in Data Science and Analytics at EPITA : Ecole d'Ingénieurs en Informatique
January 11, 2030 - August 27, 2025
Bachelors in Computer Science Engineering at KPR Institute of Engineering and Technology
January 11, 2030 - August 27, 2025

Qualifications

Introduction to Generative AI – Google Cloud
January 11, 2030 - August 27, 2025
Cloud Computing – AWS
January 11, 2030 - August 27, 2025
Introduction to Data Engineer on Azure - Microsoft
January 11, 2030 - August 27, 2025
Engineer Data for Predictive Modeling with BigQuery ML – Google Cloud
January 11, 2030 - August 27, 2025

Industry Experience

Software & Internet, Financial Services, Healthcare, Education, Travel & Hospitality

Experience Level

Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
See more