I'm a Data Scientist with 10+ years of experience across academia and industry, specializing in machine learning and data management in biotechnology, with growing focus on healthcare, e-commerce, and hospitality. I enjoy turning complex data into actionable insights and scalable ML solutions. In academia I presented at international conferences and published in peer-reviewed journals. In industry I have built and deployed deep learning applications (CNNs, LSTMs, NLP models) to boost predictive performance and decision-making; I supported a €1.2M project and contributed to a SaaS platform generating over €500K in stable revenue. I have strong experience in CLV modeling, medical device data analysis, time-series forecasting, anomaly detection, and inventory optimization, with a solid NLP background (LLM fine-tuning, RAG, GraphRAG). I'm comfortable in Agile environments with CI/CD, delivering production-ready ML solutions. Recently at L’Oréal Paris I joined a taskforce to integrate a production NLP pipeline for Japanese e-commerce with the Paris data team and Japan business teams.

Ryoji Takahashi

I'm a Data Scientist with 10+ years of experience across academia and industry, specializing in machine learning and data management in biotechnology, with growing focus on healthcare, e-commerce, and hospitality. I enjoy turning complex data into actionable insights and scalable ML solutions. In academia I presented at international conferences and published in peer-reviewed journals. In industry I have built and deployed deep learning applications (CNNs, LSTMs, NLP models) to boost predictive performance and decision-making; I supported a €1.2M project and contributed to a SaaS platform generating over €500K in stable revenue. I have strong experience in CLV modeling, medical device data analysis, time-series forecasting, anomaly detection, and inventory optimization, with a solid NLP background (LLM fine-tuning, RAG, GraphRAG). I'm comfortable in Agile environments with CI/CD, delivering production-ready ML solutions. Recently at L’Oréal Paris I joined a taskforce to integrate a production NLP pipeline for Japanese e-commerce with the Paris data team and Japan business teams.

Available to hire

I’m a Data Scientist with 10+ years of experience across academia and industry, specializing in machine learning and data management in biotechnology, with growing focus on healthcare, e-commerce, and hospitality. I enjoy turning complex data into actionable insights and scalable ML solutions.

In academia I presented at international conferences and published in peer-reviewed journals. In industry I have built and deployed deep learning applications (CNNs, LSTMs, NLP models) to boost predictive performance and decision-making; I supported a €1.2M project and contributed to a SaaS platform generating over €500K in stable revenue. I have strong experience in CLV modeling, medical device data analysis, time-series forecasting, anomaly detection, and inventory optimization, with a solid NLP background (LLM fine-tuning, RAG, GraphRAG). I’m comfortable in Agile environments with CI/CD, delivering production-ready ML solutions. Recently at L’Oréal Paris I joined a taskforce to integrate a production NLP pipeline for Japanese e-commerce with the Paris data team and Japan business teams.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
See more

Language

Japanese
Fluent
English
Fluent
Spanish; Castilian
Intermediate
Catalan; Valencian
Intermediate

Work Experience

Data Scientist at L’Oréal Paris (Freelancer)
July 1, 2025 - October 1, 2025
Joined a taskforce to design and integrate the end-to-end production NLP pipeline for Japanese e-commerce (reviews, Q&A, social). Collaborated with the Paris data team and the Japan business team to align requirements, SLAs, and KPIs; translated business goals into measurable ML objectives — delivering within BigQuery (GCP), Airflow, Kubernetes, among other tools. Built ingestion and preprocessing tailored to Japanese (tokenization, sentence segmentation), and operationalized sentiment/topic models (Gemini/GPT) at scale.
Data Scientist at Zymvol Biomodeling SL
October 1, 2019 - October 1, 2024
Implemented NLP into the pipeline; built regression/classification models with scikit-learn, NumPy, pandas, PyTorch, and TensorFlow; managed large-scale data pipelines with PySpark (AWS, EC2). Led/mentored junior team members and coordinated delivery in Agile/Scrum (Zoho); explored graph-based approaches (GraphRAG, graph DBs) for entity linking, retrieval, and recommendation-style use cases.
Data Scientist at Basetis
January 1, 2019 - June 1, 2019
Bacteria colony detections from images of Petri dishes using CNN; implemented anomaly detection algorithms in the pipeline.
Data Analyst at CNAG - CRG
April 1, 2016 - December 1, 2018
Worked with RWE and multi-omics data (genomics/transcriptomics/proteomics) in Python & R to identify disease-relevant variants/genes using ML (incl. tensor decomposition), applying controlled vocabularies/ontology-aligned identifiers to harmonize sources and improve data quality across studies.
Data Scientist at Barcelona Supercomputing Center
October 1, 2010 - January 1, 2016
Implementation of the pipeline using Markov models to calculate free energy in Monte Carlo-based simulations. (Java, MATLAB, and Python development)
Postdoctoral Fellowship at University of Tulsa (and collaborators: UNAM, CCT-LSU, TAC)
December 1, 2001 - September 1, 2010
Implementing and demonstrating proofs of concepts of new theories; performing large-scale simulations in HPC.
Data Analyst at CNAG-CRG
April 1, 2016 - December 31, 2018
Worked with real-world evidence and multi-omics data (genomics/transcriptomics/proteomics) in Python & R to identify disease-relevant variants/genes using ML (including tensor decomposition), applying controlled vocabularies/ontology-aligned identifiers to harmonize sources and improve data quality across studies.
Postdoctoral Fellow at University of Tulsa; UNAM; CCT-LSU; TAC
December 1, 2001 - September 30, 2010
Implementing and demonstrating proof of concepts of new theories. Performing large-scale simulations in HPC.

Education

PhD at Albert Einstein Institute – MPI, Germany
January 1, 1997 - January 1, 2001
MS in Materials and Biophysics at Ibaraki University, Japan
January 1, 1996 - January 1, 1997
BS in Physics at International Christian University, Japan
January 1, 1992 - January 1, 1996
PhD at Albert Einstein Institute – MPI, Germany
January 1, 1997 - December 31, 2001
MS at Ibaraki University, Japan
January 1, 1996 - December 31, 1997
BS at International Christian University, Japan
January 1, 1992 - December 31, 1996

Qualifications

Add your qualifications or awards here.

Industry Experience

Healthcare, Life Sciences, Software & Internet, Retail, Travel & Hospitality, Professional Services, Media & Entertainment, Other