Available to hire
Hi, I’m Nilay Borsadiya. I’m a data engineer with over 4 years of hands-on experience building robust, scalable data infrastructure across healthcare and logistics. I design and productionize ETL pipelines using PySpark, Python, and cloud platforms to enable timely, reliable insights while prioritizing data quality and governance.
Hi, I enjoy turning complex data into actionable insights, partnering with cross-functional teams to deliver dashboards, data lakes, and governance that protect sensitive data and accelerate decision-making.
Skills
Experience Level
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Language
English
Fluent
Work Experience
Data Engineer at McKesson
January 1, 2024 - November 17, 2025Developed and maintained scalable, production-grade ETL pipelines using PySpark and AWS Glue to ingest, transform, and load over 10TB of structured data. Implemented automated reporting with Power BI and Excel VBA, reducing reporting time by 20% and manual errors by 25%. Designed an end-to-end data lake on Amazon S3 integrated with Redshift Spectrum and AWS Athena, accelerating time to insights by 40%. Established automated data quality frameworks using Great Expectations within ETL workflows, improving data accuracy and regulatory compliance for healthcare data. Built modular data ingestion components with Python, AWS Lambda, and SQS to onboard data sources quickly, boosting onboarding speed by 60%. Orchestrated batch and streaming workflows with Apache Airflow and AWS Step Functions, ensuring data availability with strict SLAs and real-time monitoring.
Data Engineer at Metasystems
February 1, 2022 - February 1, 2022Designed and developed robust batch and real-time data pipelines using Apache Kafka for streaming, Apache Spark for distributed processing, and SQL for transformation to handle high-volume logistics data. Led modernization of on-prem ETL processes by migrating to cloud-native architectures using Azure Data Factory and Azure Synapse Analytics, improving pipeline execution speed by 45%. Modeled enterprise-wide dimensional schemas in Snowflake to support BI reporting across logistics and supply chain operations. Developed reusable Python scripts to standardize structured and unstructured data, ensuring governance compliance and enhancing downstream analytics. Collaborated with DevOps to implement real-time monitoring and alerting using AWS CloudWatch, Azure Monitor, and Prometheus, minimizing downtime. Leveraged Pandas, NumPy, and Scikit-learn for preprocessing and analysis to extract actionable insights.
Data Engineer at Metasystems
January 1, 2020 - February 1, 2022Designed and developed batch and real-time data pipelines with Kafka, Spark, and SQL to handle high-volume logistics. Migrated on-prem ETL to Azure Data Factory and Azure Synapse Analytics, accelerating pipeline speed by 45%. Modeled enterprise schemas in Snowflake using star and snowflake models to support BI across logistics and supply chain. Built reusable Python transformations to standardize data formats and enforce governance. Implemented real-time monitoring with AWS CloudWatch, Azure Monitor, and Prometheus to minimize downtime. Leveraged Pandas, NumPy, and Scikit-learn to preprocess large datasets and extract insights shaping data strategies.
Education
Post-Graduation in Artificial Intelligence and Machine Learning at Lambton College, Canada
January 11, 2030 - November 17, 2025Bachelor of Technology, Information Technology at DDU, India
January 11, 2030 - November 17, 2025Post-Graduation, Artificial Intelligence and Machine Learning at Lambton College
January 11, 2030 - January 12, 2026Bachelor of Technology, Information Technology at DDU, India
January 11, 2030 - January 12, 2026Post-Graduation, Artificial Intelligence and Machine Learning at Lambton College, Canada
January 11, 2030 - January 12, 2026Bachelor of Technology, Information Technology at DDU, India
January 11, 2030 - January 12, 2026Post-Graduation, Artificial Intelligence and Machine Learning at Lambton College, Canada
January 11, 2030 - January 12, 2026Bachelor of Technology, Information Technology at DDU, India
January 11, 2030 - January 12, 2026Post-Graduation, Artificial Intelligence and Machine Learning at Lambton College, Canada
January 11, 2030 - March 2, 2026Bachelor of Technology, Information Technology at DDU, India
January 11, 2030 - March 2, 2026Post-Graduation in Artificial Intelligence and Machine Learning at Lambton College
January 11, 2030 - March 2, 2026Bachelor of Technology, Information Technology at DDU, India
January 11, 2030 - March 2, 2026Qualifications
Python and SQL for Data Analysis
January 11, 2030 - November 17, 2025Data Visualization using Microsoft Power BI and Tableau
January 11, 2030 - November 17, 2025Python and SQL for Data Analysis
January 11, 2030 - January 12, 2026Data Visualization using Microsoft Power BI and Tableau
January 11, 2030 - January 12, 2026Python and SQL for Data Analysis
January 11, 2030 - January 12, 2026Data Visualization using Microsoft Power BI and Tableau
January 11, 2030 - January 12, 2026Python and SQL for Data Analysis
January 11, 2030 - January 12, 2026Data Visualization using Microsoft Power BI and Tableau
January 11, 2030 - January 12, 2026Python and SQL for Data Analysis
January 11, 2030 - March 2, 2026Data Visualization using Microsoft Power BI and Tableau
January 11, 2030 - March 2, 2026Python and SQL for Data Analysis
January 11, 2030 - March 2, 2026Data Visualization using Microsoft Power BI and Tableau
January 11, 2030 - March 2, 2026Industry Experience
Healthcare, Transportation & Logistics, Software & Internet, Professional Services, Education, Manufacturing
Skills
Experience Level
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Hire a Data Scientist
We have the best data scientist experts on Twine. Hire a data scientist in Toronto today.