AI Data Annotator and LLM Evaluation Specialist with experience in high-accuracy data labeling, search relevance evaluation, ad quality assessment, and AI-generated content review. Experienced in working with structured guidelines to improve the quality, reliability, and factual accuracy of machine learning training data. I have hands-on experience in evaluating AI model outputs for factual correctness, logical consistency, and usefulness, including identifying hallucinations, incorrect references, and misleading reasoning. I also create realistic domain-specific prompts that simulate real-world user behavior to support LLM training and performance improvement. My work includes scoring and ranking model responses using structured rubrics, providing detailed evidence-based justifications, and ensuring consistency in large-scale annotation tasks. I am highly comfortable working independently in high-volume environments while maintaining strong attention to detail and quality standards. Core competencies include data annotation, LLM evaluation, prompt engineering, search relevance rating, content quality assessment, and AI training data validation. I am skilled in interpreting complex English-language guidelines and translating them into consistent, accurate evaluation outputs. I am actively transitioning into roles focused on AI training data, LLM evaluation, and data quality assurance, with a strong interest in contributing to the development of reliable, safe, and high-performing AI systems.

Eka Nur Fitriyani

AI Data Annotator and LLM Evaluation Specialist with experience in high-accuracy data labeling, search relevance evaluation, ad quality assessment, and AI-generated content review. Experienced in working with structured guidelines to improve the quality, reliability, and factual accuracy of machine learning training data. I have hands-on experience in evaluating AI model outputs for factual correctness, logical consistency, and usefulness, including identifying hallucinations, incorrect references, and misleading reasoning. I also create realistic domain-specific prompts that simulate real-world user behavior to support LLM training and performance improvement. My work includes scoring and ranking model responses using structured rubrics, providing detailed evidence-based justifications, and ensuring consistency in large-scale annotation tasks. I am highly comfortable working independently in high-volume environments while maintaining strong attention to detail and quality standards. Core competencies include data annotation, LLM evaluation, prompt engineering, search relevance rating, content quality assessment, and AI training data validation. I am skilled in interpreting complex English-language guidelines and translating them into consistent, accurate evaluation outputs. I am actively transitioning into roles focused on AI training data, LLM evaluation, and data quality assurance, with a strong interest in contributing to the development of reliable, safe, and high-performing AI systems.

Available to hire

AI Data Annotator and LLM Evaluation Specialist with experience in high-accuracy data labeling, search relevance evaluation, ad quality assessment, and AI-generated content review. Experienced in working with structured guidelines to improve the quality, reliability, and factual accuracy of machine learning training data.

I have hands-on experience in evaluating AI model outputs for factual correctness, logical consistency, and usefulness, including identifying hallucinations, incorrect references, and misleading reasoning. I also create realistic domain-specific prompts that simulate real-world user behavior to support LLM training and performance improvement.

My work includes scoring and ranking model responses using structured rubrics, providing detailed evidence-based justifications, and ensuring consistency in large-scale annotation tasks. I am highly comfortable working independently in high-volume environments while maintaining strong attention to detail and quality standards.

Core competencies include data annotation, LLM evaluation, prompt engineering, search relevance rating, content quality assessment, and AI training data validation. I am skilled in interpreting complex English-language guidelines and translating them into consistent, accurate evaluation outputs.

I am actively transitioning into roles focused on AI training data, LLM evaluation, and data quality assurance, with a strong interest in contributing to the development of reliable, safe, and high-performing AI systems.

See more

Language

Indonesian
Fluent
English
Advanced

Work Experience

Toll In & Toll Out Manufacturing Specialist at PT KIMIA FARMA Tbk.
September 1, 2023 - Present
Ensure all Contract Manufacturing both in Other Company or produce in Kimia Farma Plant. Provide monthly data such as OFF and OTD for all In and Out Contract Manufacturing Product. Count estimate Fee for Contract Manufacturing.
Toll Out Manufacturing Specialist at PT KIMIA FARMA Tbk.
February 1, 2021 - September 1, 2023
Ensure all Contract Manufacturing Kimia Farma in other Company (Out Contract Manufacturing). Provide monthly data such as OFF and OTD for all contract manufacturing Product.
Product Development Specialist at PT KIMIA FARMA Tbk.
February 1, 2020 - January 1, 2021
As Product Development personnel in Packaging Development (Primer, Sekunder, Tersier Package). Review all Art Work and Proof Print. Trial new packaging with R&D to apply it’s use in Plant.
Quality System Intern at PT KALBE FARMA Tbk.
February 1, 2019 - March 1, 2019
In Quality System Department, Learn about Compliance, Document Control and Quality & Training Programme. Handle 2 projects: Find GAP GMP 2018 vs PQ WHO; made an Protocol.
Data Annotator at clickworker
March 1, 2026 - Present
As a Data Annotator, I contributed to improving AI systems and search engine quality through high-precision data labeling and evaluation tasks. My responsibilities included assessing the relevance and quality of search results, images, and online advertisements in English-language projects. I consistently maintained over 80% accuracy while working independently in a high-volume, performance-driven environment. I applied strong critical thinking, attention to detail, and careful interpretation of complex English-language guidelines to ensure consistent and reliable evaluation outcomes.
Haitian Indonesian ASR Project (First Batch) at Beijing Haitian Ruisheng Technology Co., Ltd.
March 1, 2026 - April 30, 2026
Haitian–Indonesian ASR Project (First Batch) | AIDAP Indonesia Recording Contributed to an Automatic Speech Recognition (ASR) data collection project aimed at improving multilingual AI speech models by generating high-quality audio datasets. Performed text-to-speech audio recording tasks following strict pronunciation, clarity, and annotation guidelines Produced clean and consistent audio inputs for ASR model training and evaluation Total recorded duration: 3.77 hours (effective usable audio: 3.22 hours) Ensured data quality standards for machine learning dataset preparation in speech recognition systems Project platform: https://aidap.caih.com/ Key skills: Speech Data Collection, ASR (Automatic Speech Recognition), Audio Annotation, Data Labeling, Linguistic Accuracy, AI Training Data, Quality Control, Attention to Detail

Education

Apt. at Universitas Gadjah Mada
January 1, 2018 - January 1, 2019
Bachelor of Pharmacy (S. Farm) at Universitas Gadjah Mada
January 1, 2014 - January 1, 2018

Qualifications

Add your qualifications or awards here.

Industry Experience

Healthcare, Manufacturing, Life Sciences, Professional Services