I'm a developer who specializes in Data Science, Machine Learning, and Business Intelligence, passionate about transforming complex data into actionable insights. Based in Dublin, Ireland, I combine analytical thinking with software engineering to build intelligent, data-driven solutions. I have hands-on experience developing AI pipelines, dashboards, and predictive models using tools like Python, Power BI, SQL, PyTorch, and Dash. What sets me apart is my ability to bridge the gap between technical AI development and business-focused analytics, delivering solutions that are not only accurate but practical for decision-makers. šŸ’¼ Employment and Project Experience Data Science Intern — Learnet Academy, Pune January 2024 – September 2024 Built a hybrid recommendation system combining collaborative and content-based filtering. Automated ETL and reporting pipelines using Python and SQL, reducing manual effort by 30%. Improved user engagement by 20% through personalized insights. Machine Learning Intern — I-Gurus Consultancy, Pune January 2023 – May 2023 Developed an OCR system using CNN + CRNN architectures, achieving 92% accuracy. Optimized model performance through augmentation and transfer learning, cutting training time by 25%. Delivered dashboards to visualize KPIs and model performance metrics. Data Analytics Intern — Oasis Infobyte June 2025 – July 2025 Designed interactive Power BI dashboards for segmentation and fraud detection. Built SQL reporting pipelines that improved data accessibility and reduced report generation time by 15%. Academic Research Project (MSc Dissertation) — Technological University Dublin May 2025 – September 2025 Designed a multimodal AI system integrating computer vision and NLP with transformer-based models. Deployed a Hugging Face demo achieving 81.4% accuracy, optimized for speed and explainability.…

Chaitanya Rajesh Deokar

ī˜‚ī˜‚
ī˜‚ī˜‚
ī˜‚ī˜‚
ī˜‚ī˜‚
ī˜‚

I'm a developer who specializes in Data Science, Machine Learning, and Business Intelligence, passionate about transforming complex data into actionable insights. Based in Dublin, Ireland, I combine analytical thinking with software engineering to build intelligent, data-driven solutions. I have hands-on experience developing AI pipelines, dashboards, and predictive models using tools like Python, Power BI, SQL, PyTorch, and Dash. What sets me apart is my ability to bridge the gap between technical AI development and business-focused analytics, delivering solutions that are not only accurate but practical for decision-makers. šŸ’¼ Employment and Project Experience Data Science Intern — Learnet Academy, Pune January 2024 – September 2024 Built a hybrid recommendation system combining collaborative and content-based filtering. Automated ETL and reporting pipelines using Python and SQL, reducing manual effort by 30%. Improved user engagement by 20% through personalized insights. Machine Learning Intern — I-Gurus Consultancy, Pune January 2023 – May 2023 Developed an OCR system using CNN + CRNN architectures, achieving 92% accuracy. Optimized model performance through augmentation and transfer learning, cutting training time by 25%. Delivered dashboards to visualize KPIs and model performance metrics. Data Analytics Intern — Oasis Infobyte June 2025 – July 2025 Designed interactive Power BI dashboards for segmentation and fraud detection. Built SQL reporting pipelines that improved data accessibility and reduced report generation time by 15%. Academic Research Project (MSc Dissertation) — Technological University Dublin May 2025 – September 2025 Designed a multimodal AI system integrating computer vision and NLP with transformer-based models. Deployed a Hugging Face demo achieving 81.4% accuracy, optimized for speed and explainability.…

Available to hire

I’m a developer who specializes in Data Science, Machine Learning, and Business Intelligence, passionate about transforming complex data into actionable insights. Based in Dublin, Ireland, I combine analytical thinking with software engineering to build intelligent, data-driven solutions.
I have hands-on experience developing AI pipelines, dashboards, and predictive models using tools like Python, Power BI, SQL, PyTorch, and Dash. What sets me apart is my ability to bridge the gap between technical AI development and business-focused analytics, delivering solutions that are not only accurate but practical for decision-makers.

šŸ’¼ Employment and Project Experience

Data Science Intern — Learnet Academy, Pune
January 2024 – September 2024
Built a hybrid recommendation system combining collaborative and content-based filtering.
Automated ETL and reporting pipelines using Python and SQL, reducing manual effort by 30%.
Improved user engagement by 20% through personalized insights.

Machine Learning Intern — I-Gurus Consultancy, Pune
January 2023 – May 2023
Developed an OCR system using CNN + CRNN architectures, achieving 92% accuracy.
Optimized model performance through augmentation and transfer learning, cutting training time by 25%.
Delivered dashboards to visualize KPIs and model performance metrics.

Data Analytics Intern — Oasis Infobyte
June 2025 – July 2025
Designed interactive Power BI dashboards for segmentation and fraud detection.
Built SQL reporting pipelines that improved data accessibility and reduced report generation time by 15%.

Academic Research Project (MSc Dissertation) — Technological University Dublin
May 2025 – September 2025
Designed a multimodal AI system integrating computer vision and NLP with transformer-based models.
Deployed a Hugging Face demo achieving 81.4% accuracy, optimized for speed and explainability.

See more

Language

English
Fluent

Work Experience

Data Science Intern at Learnet Academy
January 15, 2024 - September 9, 2024
Engineered hybrid recommendation system (collaborative + content-based filtering) improving user engagement by 20%. Automated attendance tracking using OpenCV/KNN face recognition, reducing manual errors by 30%.
Machine Learning Intern at I-Gurus Consultancy
January 2, 2023 - May 15, 2023
Developed Devanagari OCR system using CNN+CRNN models (92% accuracy). Optimized image preprocessing pipeline, reducing inference time by 40%.
AI/ Data Scientist at TUD
May 5, 2025 - September 30, 2025
Designed and deployed a multimodal AI pipeline integrating computer vision and NLP, achieving 81.4% (Strict) and 75.6% (VQA2) on benchmark datasets. Conducted end-to-end data preprocessing, feature engineering, hyperparameter tuning, and model optimization, reducing inference latency to 392ms/query. Integrated transformers, reinforcement learning (PPO), and memory-augmented retrieval, improving model reasoning and explainability. Released a reproducible Hugging Face demo, showcasing scalable MLOps, deployment, and reproducibility
Data Analytics Intern at Oasis Infobyte
June 2, 2025 - July 7, 2025
Applied statistical modelling, feature engineering, and ML algorithms (logistic regression, random forest) for fraud detection and segmentation on dataset of 50K+ records. Standardized SQL pipelines and data cleaning workflows, improving reporting speed by 15%. Designed Power BI dashboards to visualize predictive modelling outcomes for stakeholders.

Education

MSc Data Science at Technological University Dublin
September 12, 2024 - September 30, 2025
2:1 Honours, Upper Second Class
BE Artificial Intelligence & Data Science at ADYPSOE, Pune, India
June 15, 2020 - June 18, 2024
First Class Honours, CGPA: 8.0/10

Qualifications

Python for Data Science, AI & Development Certificate 1
January 1, 2024 - December 31, 2024
IBM Data Science Professional Certificate
January 1, 2023 - December 31, 2023
Artificial General Intelligence
January 1, 2023 - December 31, 2023
Microsoft Azure AI Fundamentals
January 1, 2025 - October 15, 2025

Industry Experience

Software & Internet, Computers & Electronics, Retail, Other, Education
    paper Multimodal AI System for Visual Question Answering

    Developed and deployed an AI pipeline integrating computer vision and natural language processing for a Visual Question Answering (VQA) task.
    The system used transformer-based models and reinforcement learning (PPO) to improve reasoning and response accuracy.
    Optimized model latency to 392ms/query and deployed the solution as a reproducible Hugging Face demo with an interactive dashboard interface.

    🧩 Tech Stack:
    Python Ā· PyTorch Ā· Hugging Face Ā· Flask Ā· MAG Ā· Reinforcement Learning(PPO)

    šŸŽÆ Outcomes:

    Achieved 81.4% (Strict) and 75.6% (VQA2) accuracy on benchmark datasets
    Released production-ready open-source demo

    paper ChurnChampion: Churn Prediction Modeling

    ChurnChampion is a predictive analytics project designed to forecast customer churn and help businesses improve retention strategies. The project includes data preprocessing, feature engineering, model training, and interactive visualization through a Dash web app.
    Machine learning models such as Logistic Regression, Random Forest, and XGBoost were used to predict churn probability, while the dashboard allows users to visualize model performance, feature importance, and customer behavior trends.

    Key Features:
    End-to-end machine learning pipeline for churn prediction
    Interactive Dash web app for exploring churn patterns and model metrics
    Visualizations: Confusion Matrix, ROC Curve, Feature Importance, and Churn Distribution
    Reusable project structure for training, evaluation, and reporting

    Tech Stack:
    Python Ā· Scikit-learn Ā· Dash Ā· Plotly Ā· Pandas Ā· NumPy Ā· Seaborn Ā· Matplotli

    paper OCR System for Document Recognition

    Developed an OCR pipeline using CNN + CRNN architectures, achieving 92% accuracy on a 200K-image dataset. Optimized training using augmentation, transfer learning, and early stopping, reducing model training time by 25%. Deployed for 10K+ active users to automate document processing.

    Tech Stack: Python Ā· TensorFlow Ā· OpenCV Ā· CNN/CRNN

    paper Movie Recommendation System

    Designed a collaborative + content-based recommendation engine for a learning platform, boosting user engagement by 20%. Automated ETL pipelines and reporting workflows, reducing manual data effort by 30% and improving personalization accuracy.
    Tech Stack: Python Ā· Pandas Ā· NumPy Ā· Matplotlib

    Roles: Data Scientist Ā· ML Developer

    paper Stock Market Forecasting Dashboard

    Created time-series forecasting models (ARIMA, LSTM) to predict stock trends using 5 years of historical data. Built an interactive visualization dashboard for investors to explore predictions, trends, and confidence intervals.

    Tech Stack: Python Ā· Power BI Ā· Matplotlib Ā· Pandas

    paper Sentiment Analysis on Customer Reviews

    Developed an NLP pipeline using LSTMs and Transformer models on 50K+ customer reviews to classify sentiment and extract key insights. Built dashboards to visualize sentiment trends and feedback categories for business intelligence. Achieved 85%+ classification accuracy.

    Tech Stack: Python Ā· PyTorch Ā· Hugging Face Ā· Matplotlib

    paper Customer Segmentation & Marketing Insights

    Applied K-Means clustering and PCA on a 30K+ customer dataset to identify marketing segments and improve campaign targeting. Built a Power BI dashboard to visualize clusters and KPIs across demographics, purchasing patterns, and engagement levels.

    Tech Stack: Power BI Ā· Python (Pandas, Scikit-learn) Ā· SQL

    paper Fraud Detection System

    Developed supervised ML models (Logistic Regression, Random Forest) and anomaly detection pipelines for fraud detection using 200K+ transactional records. Performed feature engineering and evaluated performance with confusion matrix, precision, and recall metrics. Visualized fraud patterns and risk scores for business users.

    Tech Stack: Python Ā· Scikit-learn Ā· SQL Ā· Power BI

    paper Predictive Maintenance Dashboard

    Built an end-to-end predictive maintenance solution for IoT sensor data (100K+ readings). Trained Random Forest, Gradient Boosting, and LSTM models to forecast equipment failures and integrated the models with a Flask API for real-time alerts. Designed an interactive Power BI dashboard to visualize sensor trends and failure predictions.
    Tech Stack: Python Ā· Power BI Ā· Flask Ā· SQL

    paper Customer Segmentation and Performance Dashboard (Power BI)

    Built an interactive Power BI dashboard to visualize customer segmentation and key performance metrics using SQL-integrated datasets.
    Implemented data cleaning and transformation pipelines for 50K+ records, improving reporting efficiency by 15%.
    Used DAX, Power Query, and SQL joins for advanced KPIs and dynamic filtering.

    🧩 Tech Stack:
    Power BI Ā· SQL Ā· Excel Ā· Python (Pandas)

    šŸŽÆ Outcomes:
    Delivered real-time analytics dashboard for business reporting
    Enhanced data visibility and decision-making for marketing teams