Hi! I'm Hasnain, a passionate Data Science practitioner based in Karachi, Pakistan. I specialize in statistical analysis, predictive modeling, and developing robust data pipelines. Currently, I'm gaining valuable industry experience through a technical internship at Developer Hub Corporation, where I contribute to machine learning workflow automation and data infrastructure optimization. I enjoy tackling real-world data problems, whether it's building classification models, conducting multivariate analysis, or creating intuitive dashboards for better decision-making. My academic journey in Software Engineering has equipped me with the skills and practical knowledge to excel in data science, and I'm excited about continuing to grow in this dynamic field.

HASNAIN HISSAM

Hi! I'm Hasnain, a passionate Data Science practitioner based in Karachi, Pakistan. I specialize in statistical analysis, predictive modeling, and developing robust data pipelines. Currently, I'm gaining valuable industry experience through a technical internship at Developer Hub Corporation, where I contribute to machine learning workflow automation and data infrastructure optimization. I enjoy tackling real-world data problems, whether it's building classification models, conducting multivariate analysis, or creating intuitive dashboards for better decision-making. My academic journey in Software Engineering has equipped me with the skills and practical knowledge to excel in data science, and I'm excited about continuing to grow in this dynamic field.

Available to hire

Hi! I’m Hasnain, a passionate Data Science practitioner based in Karachi, Pakistan. I specialize in statistical analysis, predictive modeling, and developing robust data pipelines. Currently, I’m gaining valuable industry experience through a technical internship at Developer Hub Corporation, where I contribute to machine learning workflow automation and data infrastructure optimization.

I enjoy tackling real-world data problems, whether it’s building classification models, conducting multivariate analysis, or creating intuitive dashboards for better decision-making. My academic journey in Software Engineering has equipped me with the skills and practical knowledge to excel in data science, and I’m excited about continuing to grow in this dynamic field.

See more

Experience Level

Expert
Expert
Expert
Intermediate
Intermediate
Intermediate

Language

English
Advanced
Urdu
Fluent

Work Experience

Data Science Intern at Developer's Hub Corporation
June 1, 2025 - July 31, 2025
Doing Internship for enhancing skills and gaining experience about data science and analytics.

Education

Bachelor's of Engineering in Software at Mehran University of Engineering and Technology
November 21, 2021 - November 22, 2025
Bachelor of Engineering (B.E) at Mehran University of Engineering and Technology, Jamshoro
November 1, 2021 - December 31, 2024

Qualifications

Data Camp Certified Statistical Analyst
May 1, 2022 - November 15, 2022
Udemy Certified Power Bi Expert
December 1, 2022 - April 30, 2023
Data Camp Certified Machine Learning Expert
May 1, 2023 - August 30, 2023
Data Camp Certified Data Analyst
September 1, 2023 - November 30, 2023
Unsupervised Learning in Python
June 1, 2025 - June 30, 2025
Machine Learning with Tree-Based Models in Python
June 1, 2025 - June 30, 2025

Industry Experience

Software & Internet, Professional Services, Media & Entertainment, Education
    paper Movie Recommender System Using Python
    Just built a Movie Recommender System using Python, Machine Learning & Streamlit — and it's live in action! 🎬🎉 Tired of watching the same old movies? My app helps you discover similar titles based on your favorite ones using: ✅ TF-IDF Vectorization ✅ Cosine Similarity ✅ Content-Based Filtering ✅ And a beautiful interactive dashboard built with Streamlit ✨ Features: 🎯 Type any movie name — get intelligent suggestions 📊 Adjust the number of recommendations dynamically 🌐 Visit the official movie homepage directly 🧠 Powered by real-time text similarity & fuzzy matching 💻 Clean UI with an engaging user experience 🔧 Tools Used: 🐍 Python 🐼 Pandas 🧠 Scikit-learn 🌐 Streamlit 🧮 TF-IDF 📏 Cosine Similarity 🔍 difflib 🎥 Try it out yourself or check the code (DM me if interested!) hashtagMachineLearning hashtagPython hashtagStreamlit hashtagDataScience hashtagAI hashtagMLProject hashtagRecommenderSystem hashtagMovieRecommendation hashtagContentBasedFiltering hashtagLinkedInProjects hashtagTechCommunity hashtagOpenToWork hashtag100DaysOfCode hashtagPortfolioProject hashtagAIForGood hashtagGitHubShowcase hashtagCodeNewbie hashtagDSInternship hashtagDataProjects hashtagPythonForDataScience hashtagArtificialIntelligence hashtagdataengineer hashtagStreamlitDashboard hashtagDataScientist hashtagMachineLearningExpert hashtagMLRecommenderSystem
    paper Movie Recommender System Using Python
    Just built a Movie Recommender System using Python, Machine Learning & Streamlit — and it's live in action! 🎬🎉 Tired of watching the same old movies? My app helps you discover similar titles based on your favorite ones using: ✅ TF-IDF Vectorization ✅ Cosine Similarity ✅ Content-Based Filtering ✅ And a beautiful interactive dashboard built with Streamlit ✨ Features: 🎯 Type any movie name — get intelligent suggestions 📊 Adjust the number of recommendations dynamically 🌐 Visit the official movie homepage directly 🧠 Powered by real-time text similarity & fuzzy matching 💻 Clean UI with an engaging user experience 🔧 Tools Used: 🐍 Python 🐼 Pandas 🧠 Scikit-learn 🌐 Streamlit 🧮 TF-IDF 📏 Cosine Similarity 🔍 difflib 🎥 Try it out yourself or check the code (DM me if interested!) hashtagMachineLearning hashtagPython hashtagStreamlit hashtagDataScience hashtagAI hashtagMLProject hashtagRecommenderSystem hashtagMovieRecommendation hashtagContentBasedFiltering hashtagLinkedInProjects hashtagTechCommunity hashtagOpenToWork hashtag100DaysOfCode hashtagPortfolioProject hashtagAIForGood hashtagGitHubShowcase hashtagCodeNewbie hashtagDSInternship hashtagDataProjects hashtagPythonForDataScience hashtagArtificialIntelligence hashtagdataengineer hashtagStreamlitDashboard hashtagDataScientist hashtagMachineLearningExpert hashtagMLRecommenderSystem
    paper Customer Churn Prediction
    My Customer Churn Prediction Web App! 🛃 I'm excited to share one of my latest end-to-end projects where I built a Customer Churn Prediction App using: ✅ Python ✅ Scikit-learn ✅ Streamlit ✅ Machine Learning models like Logistic Regression, K-Nearest Neighbors, and Random Forest ✅ GridSearchCV for hyperparameter tuning ✅ Joblib for model persistence ✅ Interactive UI for real-time predictions 📊 From data preprocessing, EDA, modeling, and evaluation to deploying a fully interactive Streamlit app, this project was a complete pipeline! 💡 The app predicts whether a customer is likely to churn based on their demographics and banking activity, helping businesses make data-driven retention strategies. 🔥 Key Highlights: Grid search optimized models Clean and intuitive UI Scaled features and saved model for efficient deployment MachineLearning hashtagDataScience hashtagPython hashtagStreamlit hashtagChurnPrediction hashtagCustomerRetention hashtagArtificialIntelligence hashtagML hashtagBankingAnalytics hashtagEndToEndProject hashtagAI hashtagDataVisualization hashtagProjectShowcase hashtagWomenInTech hashtagDeepLearning hashtagRandomForest hashtagKNN hashtagLogisticRegression hashtagLinkedInLearning hashtagGitHubProjects hashtag100DaysOfCode hashtagCodingLife
    paper Stanford Open Policing Project

    Data Analysis of Stanford Open Policing Project
    I conducted a comprehensive data analysis of the Stanford Open Policing Project using Python libraries such as pandas, numpy, matplotlib, seaborn, and scipy.stats. Key aspects of the project included:
    Data Cleaning & Preparation: Removed null values, merged date and time columns for better analysis, and converted data types for accuracy.
    Exploratory Data Analysis (EDA): Visualized driver demographics, drug-related stops, and district-wise accident distributions through bar charts, pie charts, and line plots.
    Gender & Violation Analysis: Investigated the impact of gender on arrests, stop outcomes for speeding violations, and calculated violation counts across different districts.
    Time-Based Trends: Analyzed arrest rates by the hour and annual trends for drug-related stops and search-conducted rates.
    Violation Duration: Mapped and visualized the average stop duration for different violation types.

    paper Breast Cancer Diagnosis Using Logistic Regression

    In this project, I developed a predictive model using logistic regression to classify breast cancer as malignant or benign based on diagnostic data. The dataset was preprocessed by handling missing values, dropping irrelevant columns, and converting categorical data into numerical form for better model performance.
    Key Steps:
    Data Preprocessing: Cleaned the dataset by removing unnecessary columns and handling missing values. Categorical variables were encoded into binary form to aid model understanding.
    Exploratory Data Analysis: Utilized Seaborn and Matplotlib to explore data distributions and relationships, including a correlation matrix heatmap to identify important features.
    Model Training: Applied a Logistic Regression model after scaling the features using StandardScaler to improve convergence. The dataset was split into training and testing sets to evaluate model performance.
    Model Evaluation: Achieved an impressive accuracy of 98.2%. Evaluated the model using a confusion matrix, classification report, and ROC curve analysis, which showed strong predictive performance with an AUC score reflecting excellent discriminative ability.
    Visualization & Insights:
    Generated insightful visualizations including heatmaps and ROC curves that not only supported the model evaluation but also provided clear communication of the findings.
    This project demonstrates my proficiency in data preprocessing, visualization, model building, and evaluation using Python libraries such as pandas, numpy, scikit-learn, seaborn, and matplotlib, showcasing my ability to deliver actionable insights and accurate predictions in healthcare analytics.