I’m Harshada Sasturkar, a Data Scientist with 3 years of experience in data analysis, predictive modelling and data visualization. I’m proficient in Python and SQL with a goal to drive business growth by implementing data-driven solutions. I recently completed my Master of Science in Data Science at Northeastern University with a 4.00 GPA and hold a BE in Computer Engineering from Pune Institute of Computer Technology. I thrive in cross-functional teams and enjoy turning complex data into actionable insights and measurable business impact.

Harshada Sasturkar

I’m Harshada Sasturkar, a Data Scientist with 3 years of experience in data analysis, predictive modelling and data visualization. I’m proficient in Python and SQL with a goal to drive business growth by implementing data-driven solutions. I recently completed my Master of Science in Data Science at Northeastern University with a 4.00 GPA and hold a BE in Computer Engineering from Pune Institute of Computer Technology. I thrive in cross-functional teams and enjoy turning complex data into actionable insights and measurable business impact.

Available to hire

I’m Harshada Sasturkar, a Data Scientist with 3 years of experience in data analysis, predictive modelling and data visualization. I’m proficient in Python and SQL with a goal to drive business growth by implementing data-driven solutions.

I recently completed my Master of Science in Data Science at Northeastern University with a 4.00 GPA and hold a BE in Computer Engineering from Pune Institute of Computer Technology. I thrive in cross-functional teams and enjoy turning complex data into actionable insights and measurable business impact.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate

Language

English
Fluent

Work Experience

Data Science Engineer at Flexday Solutions LLC
May 1, 2025 - Present
Built demand-driven staffing forecast model using PySpark to predict next-day and future labor needs based on outbound workload and order volumes, reducing total labor hours by 32%. Deployed Incorta dashboard visualizing shift-level forecasts and workload insights, enabling operations teams to make data-driven staffing decisions and track forecast accuracy. Built an analysis engine to predict downtime events by leveraging patterns in production line event data and created Power BI dashboard presenting key data insights to enable proactive operator interventions and improve line efficiency.
Software Engineer - AI at Protos Software LLC
September 1, 2024 - May 1, 2025
Engineered AI document assistant using LangChain, GPT-4, and Pinecone vector database to query operating manuals and permits, enabling 50% faster access to critical maintenance information and improving troubleshooting efficiency. Built and deployed text-to-SQL conversational AI agents for airline database using LangChain, LangGraph and OpenWebUI, implementing multi-agent workflows to enable natural language querying with 90% accuracy. Developed predictive models using AWS AutoGluon for wastewater treatment plants to estimate 5-day Biological Oxygen Demand (BOD5) on the same day with 80% accuracy (R² = 0.8), eliminating the 5-day wait time and ensuring environmental compliance. Developed an IoT data ingestion pipeline using AWS SQS, Lambda, ECS, and Terraform to fetch IoT sensor data from external APIs and load it into Losant, enabling real-time equipment monitoring and faster operational decisions.
Data Science Intern at FoundersEdge
June 1, 2024 - September 1, 2024
Addressed the challenge of predicting startup exit valuations from founder attributes by implementing Random Forest and SVM with statistical feature selection and class imbalance handling, boosting F1 score by 15% and reducing bias in investment decisions. Overcame scarcity in founder characteristics data by conducting digital surveys, resulting in 50% increase in acquired data and improving insights on factors influencing startup outcomes.
Data Science Intern at Flexday Solutions LLC
June 1, 2023 - August 1, 2023
Built an ML framework to automate regression and classification workflows from data analysis to model training, reducing manual effort by 40% and accelerating time-to-insights. Integrated LightGBM and XGBoost with Optuna optimization and SHAP feature selection, achieving 10% higher accuracy and improved prediction performance over manual approaches.
Associate Software Engineer at Icertis
February 1, 2021 - March 1, 2022
Optimized user data processing in Icertis Contract Intelligence Platform by collaboratively developing a User Management System in C#, resulting in 15% faster batch processing and improved scalability. Created MySQL scheduled tasks to automatically retrieve and organize JSON-formatted user data into structured tables, streamlining role, group, and permission assignments for users. Customized JavaScript dashboards with tailored KPIs to track contract stages, providing clients real-time visibility and actionable insights, leading to 20% increase in turnaround efficiency.

Education

Master of Science in Data Science at Northeastern University
September 1, 2022 - May 1, 2024
Bachelor of Engineering in Computer Engineering at Pune Institute of Computer Technology
August 1, 2017 - July 1, 2021

Qualifications

Add your qualifications or awards here.

Industry Experience

Software & Internet, Professional Services, Manufacturing, Education, Media & Entertainment