I am a data scientist with 4 years of experience developing and deploying AI-driven solutions, specializing in Natural Language Processing (NLP). I have worked across the full AI lifecycle, from data acquisition and preprocessing to model development, optimization, deployment, and monitoring. I enjoy building transformer-based NLP models, implementing fine-tuning and prompt-engineering strategies, and integrating AI systems into production environments. I am passionate about leveraging deep learning, generative AI, and scalable data pipelines to create language-aware systems that deliver measurable impact.

Ahmet Emin Tek

I am a data scientist with 4 years of experience developing and deploying AI-driven solutions, specializing in Natural Language Processing (NLP). I have worked across the full AI lifecycle, from data acquisition and preprocessing to model development, optimization, deployment, and monitoring. I enjoy building transformer-based NLP models, implementing fine-tuning and prompt-engineering strategies, and integrating AI systems into production environments. I am passionate about leveraging deep learning, generative AI, and scalable data pipelines to create language-aware systems that deliver measurable impact.

Available to hire

I am a data scientist with 4 years of experience developing and deploying AI-driven solutions, specializing in Natural Language Processing (NLP).

I have worked across the full AI lifecycle, from data acquisition and preprocessing to model development, optimization, deployment, and monitoring. I enjoy building transformer-based NLP models, implementing fine-tuning and prompt-engineering strategies, and integrating AI systems into production environments. I am passionate about leveraging deep learning, generative AI, and scalable data pipelines to create language-aware systems that deliver measurable impact.

See more

Language

English
Fluent
Turkish
Fluent

Work Experience

Data Scientist at John Snow Labs
January 1, 2021 - Present
Trained and deployed AI/NLP models across healthcare NLP stack: developed NER models to extract entities, relation extraction, text classification for data categorization, PHI de-identification, and entity resolution mapping to clinical codes (RxNorm, ICD10, SNOMED, ICDO, NDC, LOINC, HCPCS, CPT). Prepared data with chunk-mapping for efficient entity alignment and produced pre-trained NLP pipelines for client-specific and library-wide use. Implemented assertion status models to detect presence, absence, or uncertainty of clinical findings. Collaborated with enterprise customers to design and deliver end-to-end NLP solutions, including data pipeline creation, model training/evaluation, and production deployment. Built modular Python components to automate pretrained model integration for faster inference, and integrated custom automation modules into internal library. Guided customer data science teams on SparkNLP usage and created demo applications and notebooks.
Intern Data Scientist at Datajarlabs
March 1, 2021 - August 1, 2021
Assisted in teaching during the Data Science Bootcamp program. Contributed to curriculum and content creation, and evaluated students’ projects for quality and accuracy.

Education

Higher Diploma in Science in Data Analytics for Business - NFQ Level 8 at CCT College Dublin
January 1, 2022 - January 1, 2023
BSc. Psychology at Adnan Menderes University, Aydin, Turkey
January 1, 2016 - January 1, 2020

Qualifications

SparkNLP for Healthcare Data Scientist
January 1, 2021 - January 6, 2026
SparkNLP Data Scientist
January 1, 2021 - January 6, 2026
Data Science Bootcamp
January 1, 2020 - January 6, 2026

Industry Experience

Software & Internet, Healthcare, Life Sciences