Hi, I’m Mirza Klimenta. I hold a PhD in Computer Science from the University of Konstanz and work as a Senior Data Scientist focusing on LLMs/RAGs, Recommender Systems, Knowledge Graphs and Classical ML. I design and deploy production-ready models, including a recommender system powering ARD Audiothek, Germany’s popular audio-on-demand platform. In parallel with research, I contribute to open-source projects, publish on graph drawing and ML topics, and I’m also a writer of literary fiction, with a novel titled Waters of the Green. I enjoy bridging academia and industry to solve real-world problems.

Mirza Klimenta

Hi, I’m Mirza Klimenta. I hold a PhD in Computer Science from the University of Konstanz and work as a Senior Data Scientist focusing on LLMs/RAGs, Recommender Systems, Knowledge Graphs and Classical ML. I design and deploy production-ready models, including a recommender system powering ARD Audiothek, Germany’s popular audio-on-demand platform. In parallel with research, I contribute to open-source projects, publish on graph drawing and ML topics, and I’m also a writer of literary fiction, with a novel titled Waters of the Green. I enjoy bridging academia and industry to solve real-world problems.

Available to hire

Hi, I’m Mirza Klimenta. I hold a PhD in Computer Science from the University of Konstanz and work as a Senior Data Scientist focusing on LLMs/RAGs, Recommender Systems, Knowledge Graphs and Classical ML. I design and deploy production-ready models, including a recommender system powering ARD Audiothek, Germany’s popular audio-on-demand platform.

In parallel with research, I contribute to open-source projects, publish on graph drawing and ML topics, and I’m also a writer of literary fiction, with a novel titled Waters of the Green. I enjoy bridging academia and industry to solve real-world problems.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
See more

Language

Bosnian
Fluent
English
Fluent
German
Fluent

Work Experience

Senior Data Scientist - Freelancer at Freelancer
September 1, 2023 - Present
Leading data science projects with emphasis on large language models, retrieval augmented generation, recommender systems, and knowledge graphs; designing and deploying ML solutions; contributing to ARD Audiothek initiatives and NLP tasks such as entity recognition and redundancy removal.
Senior Data Scientist at Bayerischer Rundfunk/.pub
August 1, 2021 - August 31, 2023
Research on state-of-the-art recommender systems for media content (audio, video, text); implemented a production recommender powering ARD Audiothek; achieved ~15% higher precision; NLP projects including entity recognition and redundancy removal.
Data Scientist at Yewno/Entropy387
June 1, 2020 - July 31, 2021
Research on and implementation of graph-based stock-market prediction models; pricing engine implementation with ~10% MAE improvement over previous model.
Post-Doc at Roma Tre University
June 1, 2018 - May 31, 2019
Research on Graph Morphing Algorithms; implementation of Graph Drawing Algorithms.
Software Engineer at Visteon (former Johnson Controls)
April 1, 2014 - May 31, 2016
(Re)implementation of a testing software; implementation of Finite State Machines; code generation.
Post-Doctoral Researcher at Roma Tre University
June 1, 2018 - May 1, 2019
Research on Graph Morphing Algorithms; implementation of Graph Drawing Algorithms.
Software Engineer at Visteon (formerly Johnson Controls)
April 1, 2014 - May 1, 2016
(Re)implementation of testing software; Finite State Machines; code generation.
Project Lead at PulseSpotter - MediaLab Bayern
May 1, 2024 - October 1, 2024
PulseSpotter is designed to assist journalists in identifying newsworthy topics that are likely to become popular. By gathering information from various news sources and analyzing patterns over time, the system suggests emerging trends using AI and machine learning.
RAG (LLM) Developer at Independent Project
February 1, 2024 - September 1, 2024
Led development of an advanced Retrieval-Augmented Generation (RAG) system for HR data. Used Milvus vector search, OpenAI API; attempted fine-tuning with PEFT (LoRA); technologies included LangChain, LangGraph, Smolagents, LlamaIndex, dspy; deployed via Terraform and GitHub Actions on AWS; initial Streamlit app.
ML Engineer at Client: Spectral Imaging
September 1, 2023 - September 1, 2024
Spectral Image Object Detection: self-supervised learning for classifying sensor readings and identifying anomalies; used one of the YOLO architectures (mmYolo) for fast processing.
ML Engineer at Client: BNPL Platform (Middle East)
April 1, 2021 - August 1, 2021
Fraud detection model; data preprocessing; graph-based clique detection; first ML project for client.
ML Engineer at Client: Sensor Network
January 1, 2019 - August 1, 2019
Sensor Anomaly Detection: baseline ML model to predict readings; enhanced detection with Graph Neural Network.
ML Engineer at US Dispatcher
April 1, 2018 - December 1, 2018
Pricing Engine for a US Dispatcher: improved accuracy by ~40% via feature engineering and two-tier regression (XGBoost).
ML Engineer at StockFink
January 1, 2018 - March 1, 2018
Investment Prediction: graph-based approach predicting investor–company connections using a Graph Neural Network.
ML Engineer at Receeve
September 1, 2019 - April 1, 2020
Debt Collection: behavioral scoring model for loan repayment likelihood; integrated into collection workflow.
ML Engineer at GamingTec (Online Betting)
May 1, 2020 - August 1, 2020
Recommender for Online Betting: improved performance using Factorization Machines over Amazon Personalize HRNN.
ML Engineer at Asian Bank
September 1, 2020 - April 1, 2021
Next-Best Finance-Related Action: predictive models suggesting optimal actions for long-term financial goals.
ML Engineer at Client: Social Network
August 1, 2023 - December 1, 2023
User Grouping in a Social Network: dynamic group formation using vectorization and KNN; recommender system for refinement.
Senior Data Scientist - Freelancer at Self-employed / Freelancer
September 1, 2023 - Present
Senior Data Scientist focusing on LLMs/RAGs, Recommender Systems, Knowledge Graphs and Classical ML; leading projects and delivering production-grade models.
Senior Data Scientist at Bayerischer Rundfunk
August 1, 2021 - August 31, 2023
Research on state-of-the-art developments in Recommender Systems for media (audio, video and text). Implemented a Recommender System powering ARD Audiothek; production model achieved ~15% higher precision than the previous version. Also contributed NLP work: entity recognition and redundancy removal.

Education

Ph.D. at University of Konstanz
November 1, 2009 - December 1, 2012
B.Sc. at The University of Buckingham
October 1, 2005 - June 1, 2009
Ph.D., Computer Science at University of Konstanz
November 1, 2009 - December 1, 2012
B.Sc., Computer Science at The University Sarajevo School of Science and Technology
October 1, 2005 - June 1, 2009
B.Sc., Computer Science at The University of Buckingham
October 1, 2005 - June 1, 2009
Ph.D., Computer Science at University of Konstanz
November 1, 2009 - December 1, 2012
B.Sc., Computer Science at The University Sarajevo School of Science and Technology
October 1, 2005 - June 1, 2009

Qualifications

Personalized Recommendations at Scale
March 1, 2023 - December 18, 2025
Designing State of the Art Recommender Systems
November 1, 2022 - December 18, 2025
Personalized Recommendations at Scale
March 1, 2023 - December 21, 2025
Designing State of the Art Recommender Systems
November 1, 2022 - December 21, 2025
Personalized Recommendations at Scale
March 1, 2023 - January 6, 2026
Designing State of the Art Recommender Systems
November 1, 2022 - January 6, 2026

Industry Experience

Media & Entertainment, Software & Internet, Computers & Electronics, Professional Services, Education
    paper User Grouping in a Social Network

    Objective: To foster communication among like-minded users through dynamic
    group formation based on their responses to specific questions.
    Approach: Employed vectorization and K-Nearest Neighbors (KNN) for initial grouping,
    with a recommendation system for refined group alignment.

    paper Next-Best Finance-Related Action

    Objective: To advise individuals on financial actions that enhance their chances of
    achieving specific goals.
    Context: For an Asian bank aiming to offer actionable advice for long-term financial
    planning, such as securing a home purchase in 20 years.
    Approach: Predictive machine learning models were developed to suggest optimal
    actions (example: obtaining a salary increase of at least 10%).

    paper Recommender for Online Betting

    Objective: To surpass the performance of Amazon Personalize’s Hierarchical Recurrent
    Neural Network-based recommendation system.
    Approach: Adopted a recommender model based on Factorization Machines, significantly
    improving recommendations through targeted feature engineering.

    paper Debt Collection

    Objective: To predict the likelihood of loan repayment by bank customers, aiding
    in their segmentation for tailored communication strategies (email, SMS, or phone
    calls).
    Achievement: Created a behavioral scoring machine learning model, now incorporated
    into Receeve’s collection approach.

    paper Investment Prediction

    Objective: To forecast investors’ forthcoming decisions on company investments.
    Approach: Transformed the challenge into a graph-based problem, treating investors
    and companies as nodes. Developed a Graph Neural Network to predict potential
    investor-company connections, integrating this model into StockFink’s prediction
    suite.

    paper Pricing Engine for a US Dispatcher

    Objective: To predict vehicle transportation prices, providing dispatchers with a
    reliable basis for pricing.
    Achievements: Enhanced the machine learning model’s accuracy by approximately
    40% through advanced feature engineering and implementing a two-tiered regression
    strategy, combining residual and standard regression techniques powered by XGBoost.

    paper Sensor Anomaly Detection

    Objective: To detect unusual behavior in a sensor-monitored network.
    Method: Established a baseline machine learning model predicting expected sensor
    readings, using deviations from this model to flag anomalies. Enhanced detection
    accuracy with a Graph Neural Network, analyzing the connectivity between sensors
    and observable areas.

    paper Fraud Detection for a Buy-Now Pay-later Platform

    Objective: Fraud detection.
    Approach: I designed and developed a fraud detection model for a Middle-Eastern
    Buy-Now Pay-Later platform. A critical step for this client involved data preprocessing,
    during which I employed a graph-based approach to identify cliques of
    fraudsters. This was the first successful machine learning project for this client.

    paper Spectral Image Object Detection

    Objective: given a spectral image that depicts the value of the sensors’s readings,
    classify the signals and identify novelties (anomalies).
    Approach: Given that there was not sufficient labeled data, I had to rely on selfsupervised
    machine learning paradigms. To satisfy the customer’s request for fast
    processing, I utilized one of the YOLO architectures. The system was developed
    with the mmyolo framework.

    paper RAG (LLM)

    Objective: an LLM chatbot to help with HR-related inquiries.
    Approach: I led the development of an advanced Retrieval-Augmented Generation
    (RAG) system aimed at improving HR data retrieval processes. This system utilizes
    the Milvus Vector Similarity Search Database and OpenAI’s API to efficiently source
    and integrate extensive HR-related data, enabling it to respond to a broad spectrum
    of HR inquiries. We have also attempted to fine-tune an LLM for this task, and the
    technologies used were PEFT (LoRA), and sophisticated Prompt Engineering.