Data Scientist and AI/Machine Learning Engineer with strong foundations in data structures, algorithms, and production ML systems. Skilled in Python, Docker, Kubernetes, and Git-based workflows, with hands-on experience building real-time fraud detection models and transformer-based NLP modules. Looking for a challenging role in data science or AI/Machine Learning engineering focused on NLP, intelligent systems, and real-world AI deployment.

REZA AZARI AGHOUIEH

Data Scientist and AI/Machine Learning Engineer with strong foundations in data structures, algorithms, and production ML systems. Skilled in Python, Docker, Kubernetes, and Git-based workflows, with hands-on experience building real-time fraud detection models and transformer-based NLP modules. Looking for a challenging role in data science or AI/Machine Learning engineering focused on NLP, intelligent systems, and real-world AI deployment.

Available to hire

Data Scientist and AI/Machine Learning Engineer with strong foundations in data structures, algorithms, and production ML systems. Skilled in Python, Docker, Kubernetes, and Git-based workflows, with hands-on experience building real-time fraud detection models and transformer-based NLP modules. Looking for a challenging role in data science or AI/Machine Learning engineering focused on NLP, intelligent systems, and real-world AI deployment.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Beginner
Beginner
See more

Language

English
Advanced
Italian
Beginner
Turkish
Advanced
Persian
Fluent
Azerbaijani
Fluent

Work Experience

Data Scientist Intern / NLP Researcher at Urban/ECO Research Center
March 10, 2025 - September 30, 2025
- Contributed to development of a Hybrid Conversational AI Agent using FANTASIA framework, integrating Graph Database with LLMs, to enable explainable, semantically rich research retrieval. - Developed a Semantic Search Pipeline in Python integrating ModernBERT embeddings with Neo4j GDS algorithm, increasing similarity accuracy (10%+) while reducing retrieval latency (20%+). - Conducted embedding analysis (Pytorch/TensorFlow) and benchmarking transformer models (ModernBERT, mpnet, MiniLM), validating performance with clustering metrics (Silhouette, ARI, V-measure) to optimize semantic vector representation. - Applied Topic Modeling workflows utilizing TurfTopic (BERTopic, Vec2Top, SmolLM2), automating the generation of interpretable topic names and interactive UMAP visualizations.
Data Scientist / Remote at PRODGY InfoTech
September 1, 2024 - October 10, 2024
- Developed predictive models for customer behavior using Python (Scikit-learn); applied outlier detection, preprocessing, and cross-validation across Decision Trees, KNN/SVM algorithms (90.9% accuracy). - Performed EDA on multi-source datasets and traffic accident records to identify high-risk hotspots/trends using Python (Pandas, NumPy, Seaborn). Provided actionable road safety insights via PowerBI dashboards. - Conducted sentiment analysis on 100K+ social media posts to uncover public opinion trends, using Python (NLTK, spaCy, Matplotlib), and WordCloud for text preprocessing and distribution visualization.
IT Manager And Data Analyst at Ravin Tech Company
March 3, 2017 - November 30, 2023
- Executed domains-specific data analytics projects (finance, retail, supply chain). Designed PowerBI/Tableau dashboards using DAX/Power Query. Delivered efficient performance tracking and business insights. - Applied machine learning techniques using python to build predictive analytics models (Random Forest, XGBoost) and support data-driven decisions. - Designed 20+ websites and digital marketing solutions using WordPress/full-stack (PHP, Python, JS, HTML/CSS) and social media. Improved SEO performance to increase web engagement and conversion rates by 25%+ across platforms. - Managed IT infrastructure and systems operations, including Windows Server, Linux, virtualization (VMware, Hyper-V, Citrix), and backup solutions; ensured system reliability (99.9% system uptime, 100% data integrity).

Education

Master Of Data Science at University Of Naples Federico II
September 1, 2023 - March 31, 2026
BSC in Information Technology Engineering at Payame Noor University
September 1, 2006 - September 30, 2016

Qualifications

Programming: Python (PyTorch, TensorFlow, sklearn, Keras, Pandas, NumPy, matplotlib, Streamlit, NLTK)
September 1, 2024 - October 1, 2024
Programming: Python (PyTorch, TensorFlow, sklearn, Keras, Pandas, NumPy, matplotlib, Streamlit, NLP) – Extended
March 1, 2025 - September 1, 2025
Programming with Python and ML libraries (PyTorch, TensorFlow, sklearn, Keras)
September 1, 2024 - October 1, 2024
Graph Data Science and ML pipeline fundamentals
November 1, 2024 - July 1, 2025
BSc
September 1, 2023 - November 26, 2025
JPMorgan Chase Quantitative Research Virtual Experience
January 1, 2025 - June 3, 2026
Docker & Kubernetes Specialization
January 1, 2025 - June 3, 2026
CI/CD Specialization & Data Science Specialization
January 1, 2025 - June 3, 2026
Big Data with Spark & Hadoop
January 1, 2025 - June 3, 2026
Data Modeling in Power BI
January 1, 2025 - June 3, 2026
AWS Cloud Practitioner Essentials
January 1, 2025 - June 3, 2026
SQL for Data Science & Python Programming
January 1, 2024 - June 3, 2026

Industry Experience

Software & Internet, Professional Services, Media & Entertainment, Other, Education, Computers & Electronics, Financial Services
    RAG Pipeline and Optimization

    Investigated PPO/DPO/GRPO for alignment in retrieval-augmented generation; benchmarked performance using Ragas metrics for faithfulness and relevancy.

    Natural Gas Storage Pricing Model

    Modeled using Ornstein-Uhlenbeck mean-reversion pro cesses and Monte Carlo simulations.

    Credit Risk Calibration

    Applied Isotonic Regression and Platt Scaling for risk bucket calibration; evaluated via ROC-AUC and Kolmogorov-Smirnov statistics.

    Open-Ended Metrics for LLMs Evaluation

    Developed taxonomy of 50+ LLMs evaluation metrics; provided task-specific recommendations and decision framework.

    Multimodal Emotion Recognition using Bayesian Methods & RL

    Built a CNN-based MER pipeline with ECG/GSR and BRR/GPR models for valence–arousal prediction (MAE ≈ 0.04, RMSE ≈ 0.057); achieved personalized emotion-aware robotic adaptation via Q-learning (r = 0.7185).

    Pedestrian Detection, Image Processing

    Used and fine-tune YOLOv5 and MobileNetV2 pretrained models for pedestrian detection, achieving nearly 85% model accuracy on CityPerson dataset.

    Cryptocurrency Clustering and Analysis (Time Series Analysis)

    Used Apache Kafka and Spark with KNN and K-means algorithms for real-time cryptocurrency behavior analysis, applying Silhouette and Elbow methods to evaluate clustering performance.

    Student Performance Prediction (Time Series Analysis)

    Built a hybrid LSTM + MHSA + ANN model on the OULAD dataset with significant accuracy by 80%.