I am a Data Analyst and Machine Learning practitioner skilled in Python, SQL, Power BI, and predictive modelling. I specialize in turning raw data into clear insights, dashboards, and AI-driven solutions. I have worked on multiple end-to-end projects including fraud detection, restaurant clustering, customer analytics, and automation tools. I focus on clean data, accurate analysis, and fast delivery. I’m eager to help businesses make data-driven decisions and build smart AI systems that create real impact.

Roumyajit Sarkar

I am a Data Analyst and Machine Learning practitioner skilled in Python, SQL, Power BI, and predictive modelling. I specialize in turning raw data into clear insights, dashboards, and AI-driven solutions. I have worked on multiple end-to-end projects including fraud detection, restaurant clustering, customer analytics, and automation tools. I focus on clean data, accurate analysis, and fast delivery. I’m eager to help businesses make data-driven decisions and build smart AI systems that create real impact.

Available to hire

I am a Data Analyst and Machine Learning practitioner skilled in Python, SQL, Power BI, and predictive modelling. I specialize in turning raw data into clear insights, dashboards, and AI-driven solutions.

I have worked on multiple end-to-end projects including fraud detection, restaurant clustering, customer analytics, and automation tools. I focus on clean data, accurate analysis, and fast delivery.

I’m eager to help businesses make data-driven decisions and build smart AI systems that create real impact.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate
See more

Language

English
Fluent

Work Experience

Data Science Intern at Oasis Infobyte
October 1, 2023 - November 30, 2023
Built a Decision Tree Regressor model to predict car prices, designed data cleaning pipelines, and optimized model performance.
Data Science Intern at Bharat Intern
September 1, 2023 - October 31, 2023
Developed an end-to-end predictive model for Titanic survival analysis using Python, performed exploratory data analysis and feature engineering, and implemented multiple ML algorithms.

Education

Bachelor of Computer Applications at Chandigarh University
January 1, 2022 - February 1, 2025
Master of Science in Data Science at Chandigarh University
January 1, 2025 - December 12, 2025

Qualifications

Data Science Intellipaat
April 1, 2025 - December 12, 2025
Artificial Intelligence Intellipaat
March 1, 2025 - December 12, 2025
Advanced Microsoft Excel Intellipaat
April 1, 2025 - December 12, 2025
Microsoft SQL Intellipaat
April 1, 2025 - December 12, 2025
Python Intellipaat
April 1, 2025 - December 12, 2025
SQL Intellipaat
June 1, 2024 - December 12, 2025

Industry Experience

Software & Internet, Professional Services, Education
    paper Vacation Rental Insights – Listings & Reviews Data Analysis

    Analyzed vacation rental listings and customer reviews to understand pricing patterns, neighborhood demand, host responsiveness, review score differences, and revenue estimates. Performed data cleaning, EDA, statistical summaries, and automated validation using PyTest.
    The project provides insights for owners to improve listing quality and optimize pricing strategies.

    paper Marketing Analytics – Customer Segmentation & Business Insights

    Conducted end-to-end marketing data analysis to uncover audience behavior, customer value patterns, and buying trends. Performed RFM segmentation, clustering, visualization, KPI extraction, and developed insights for improving customer retention and marketing campaigns.
    Delivered analysis-ready datasets, visual summaries, and data-driven business recommendations.

    paper Fraud Detection Model – Large-Scale Data Processing & Classification

    Analyzed a large transactional dataset (6M+ rows) to detect fraudulent activities using advanced machine learning methods. Performed extensive data cleaning, outlier handling, feature selection, and model training using Logistic Regression, Random Forest, and Gradient Boosting.
    Optimized accuracy through hyperparameter tuning and validated workflow using automated test cases for reliability and consistency.

    paper Zomato Restaurant Clustering – Customer Segmentation Using ML

    Performed unsupervised learning on Zomato restaurant data to group similar restaurants based on ratings, reviews, cuisines, and metadata. Conducted exploratory data analysis, scaling, dimensionality reduction, and implemented K-Means and Hierarchical clustering.
    Generated actionable insights that help businesses identify patterns in customer preferences, restaurant groups, and market segments.

    paper Titanic Survival Prediction – End-to-End ML Model

    Developed a predictive model to identify passenger survival probability using supervised machine learning.
    Performed data cleaning, missing value imputation, feature engineering, exploratory data analysis, and trained multiple models such as Logistic Regression, Random Forest, and XGBoost.
    Achieved strong accuracy and built a clean, reproducible ML pipeline with visualization-based insights.