I'm a data scientist with 4+ years of experience delivering machine learning solutions and building scalable data systems in cloud environments. I excel in Python, SQL, and PySpark, and I enjoy feature engineering, model validation, and real-time inference to drive business impact.\n\nI'm known for improving model performance, reducing data processing time, and building reliable pipelines that support production analytics and decision-making. I collaborate closely with product and engineering teams to translate business requirements into deployable ML solutions with measurable outcomes.

Manas Pandya

I'm a data scientist with 4+ years of experience delivering machine learning solutions and building scalable data systems in cloud environments. I excel in Python, SQL, and PySpark, and I enjoy feature engineering, model validation, and real-time inference to drive business impact.\n\nI'm known for improving model performance, reducing data processing time, and building reliable pipelines that support production analytics and decision-making. I collaborate closely with product and engineering teams to translate business requirements into deployable ML solutions with measurable outcomes.

Available to hire

I’m a data scientist with 4+ years of experience delivering machine learning solutions and building scalable data systems in cloud environments. I excel in Python, SQL, and PySpark, and I enjoy feature engineering, model validation, and real-time inference to drive business impact.\n\nI’m known for improving model performance, reducing data processing time, and building reliable pipelines that support production analytics and decision-making. I collaborate closely with product and engineering teams to translate business requirements into deployable ML solutions with measurable outcomes.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
See more

Language

English
Fluent

Work Experience

Data Scientist / AI Engineer at MapleWave Data Solutions Inc.
January 1, 2025 - Present
Increased prediction accuracy by 22% by building classification and regression models using Python and XGBoost, enabling more precise customer segmentation and targeting strategies. Processed over 50M records using PySpark transformations, improving feature consistency and reducing data issues, leading to more stable model performance and fewer production failures during retraining cycles. Reduced data pipeline execution time by 28% by restructuring workflows in AWS Glue and optimizing storage in S3 and Redshift. Decreased inference latency by 35% by deploying REST-based prediction services using FastAPI, enabling real-time model consumption within operational systems and improving the responsiveness of data-driven applications. Improved experiment accuracy by 30% using structured validation and A/B testing. Shortened delivery timelines by 25% by collaborating with product and engineering teams to translate business requirements into deployable ML solutions. Increased adoption of data
Senior Data Engineer at Cloud Sphere IT Labs, India
January 1, 2021 - December 31, 2023
Processed 100M+ records daily by building distributed ETL pipelines using Apache Spark and Airflow, ensuring reliable data availability for analytics and machine learning workloads across multiple business functions. Improved query performance by 35% by redesigning data models and optimizing queries in AWS Redshift, enabling faster reporting, dashboard refresh cycles, and data access for analytics teams. Reduced data preparation time by 30% by delivering structured, feature-ready datasets, allowing faster model development and improving efficiency across data science workflows. Enabled near real-time analytics by integrating streaming and batch data using Kafka and API ingestion, reducing latency and supporting operational monitoring and time-sensitive reporting requirements. Decreased data inconsistencies by 25% by implementing validation checks and anomaly detection within pipelines, improving trust in data used for reporting and predictive modeling. Reduced manual effort by 40% by a

Education

Post-Graduate Certificate in Information Technology Solutions at Humber Polytechnic
August 1, 2025 - April 20, 2026
B.Tech in Computer Engineering at Bharti Vidyapeeth Deemed University, India
January 11, 2030 - July 1, 2023

Qualifications

Add your qualifications or awards here.

Industry Experience

Software & Internet, Professional Services, Education

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
See more

Hire a Data Scientist

We have the best data scientist experts on Twine. Hire a data scientist in Calgary today.