Looks like you have JavaScript disabled. For the full Twine experience, you will need to re-enable it.

I am a PhD-level AI specialist and data annotation expert with 8+ years of experience delivering high-impact work across the full AI training and evaluation stack — from multimodal annotation and LLM evaluation to ML engineering, generative AI assessment, and responsible AI research. I have contributed to frontier AI systems for the world's leading labs, consistently maintaining 95–98% first-submission accuracy across 300+ monthly tasks spanning text, image, video, audio, 3D, and code modalities. I combine deep technical expertise in machine learning, NLP, computer vision, and software engineering with the domain breadth — mathematics, physics, data science, cybersecurity, and STEM — that the most demanding AI training projects require. I am trusted with sensitive, high-stakes evaluation tasks including RLHF, red teaming, safety assessment, hallucination detection, and agent trajectory review across 20+ frontier models. I thrive in remote, asynchronous environments and deliver consistently without supervision.…I am a PhD-level AI specialist and data annotation expert with 8+ years of experience delivering high-impact work across the full AI training and evaluation stack — from multimodal annotation and LLM evaluation to ML engineering, generative AI assessment, and responsible AI research. I have contributed to frontier AI systems for the world's leading labs, consistently maintaining 95–98% first-submission accuracy across 300+ monthly tasks spanning text, image, video, audio, 3D, and code modalities. I combine deep technical expertise in machine learning, NLP, computer vision, and software engineering with the domain breadth — mathematics, physics, data science, cybersecurity, and STEM — that the most demanding AI training projects require. I am trusted with sensitive, high-stakes evaluation tasks including RLHF, red teaming, safety assessment, hallucination detection, and agent trajectory review across 20+ frontier models. I thrive in remote, asynchronous environments and deliver consistently without supervision.

PAUL MWAMBURI

Data Annotator, AI Engineer, Data Scientist, +2





I am a PhD-level AI specialist and data annotation expert with 8+ years of experience delivering high-impact work across the full AI training and evaluation stack — from multimodal annotation and LLM evaluation to ML engineering, generative AI assessment, and responsible AI research. I have contributed to frontier AI systems for the world's leading labs, consistently maintaining 95–98% first-submission accuracy across 300+ monthly tasks spanning text, image, video, audio, 3D, and code modalities. I combine deep technical expertise in machine learning, NLP, computer vision, and software engineering with the domain breadth — mathematics, physics, data science, cybersecurity, and STEM — that the most demanding AI training projects require. I am trusted with sensitive, high-stakes evaluation tasks including RLHF, red teaming, safety assessment, hallucination detection, and agent trajectory review across 20+ frontier models. I thrive in remote, asynchronous environments and deliver consistently without supervision.…I am a PhD-level AI specialist and data annotation expert with 8+ years of experience delivering high-impact work across the full AI training and evaluation stack — from multimodal annotation and LLM evaluation to ML engineering, generative AI assessment, and responsible AI research. I have contributed to frontier AI systems for the world's leading labs, consistently maintaining 95–98% first-submission accuracy across 300+ monthly tasks spanning text, image, video, audio, 3D, and code modalities. I combine deep technical expertise in machine learning, NLP, computer vision, and software engineering with the domain breadth — mathematics, physics, data science, cybersecurity, and STEM — that the most demanding AI training projects require. I am trusted with sensitive, high-stakes evaluation tasks including RLHF, red teaming, safety assessment, hallucination detection, and agent trajectory review across 20+ frontier models. I thrive in remote, asynchronous environments and deliver consistently without supervision.

Available to hire

I am a PhD-level AI specialist and data annotation expert with 8+ years of experience delivering high-impact work across the full AI training and evaluation stack — from multimodal annotation and LLM evaluation to ML engineering, generative AI assessment, and responsible AI research. I have contributed to frontier AI systems for the world’s leading labs, consistently maintaining 95–98% first-submission accuracy across 300+ monthly tasks spanning text, image, video, audio, 3D, and code modalities.

I combine deep technical expertise in machine learning, NLP, computer vision, and software engineering with the domain breadth — mathematics, physics, data science, cybersecurity, and STEM — that the most demanding AI training projects require. I am trusted with sensitive, high-stakes evaluation tasks including RLHF, red teaming, safety assessment, hallucination detection, and agent trajectory review across 20+ frontier models. I thrive in remote, asynchronous environments and deliver consistently without supervision.

Skills

Experience Level

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Expert

Intermediate

Intermediate

Intermediate

AI Collection (Video)

Intermediate

AI Collection (Audio)

Intermediate

Language

English

Fluent

Swahili

Fluent

Work Experience

AI Data Annotation & Evaluation Specialist at TELUS International AI Data Solutions

January 1, 2024 - Present

Annotate and quality-review 300+ multimedia tasks monthly across text, audio, image, and video modalities, applying structured labeling rules with 95–98% first-submission accuracy. Tasks include NER, sentiment, bounding boxes, segmentation, ASR QA, speaker diarization, RLHF ranking, and video temporal labeling. Perform rigorous self-QA, identify edge cases, and document rubric improvements that reduced rework and reviewer variance.

LLM & Multimodal Annotation Expert at Mindrift / Outlier AI

January 1, 2023 - December 31, 2023

Delivered specialist annotation across audio, video, image, and text modalities with 98% QA compliance; conducted RLHF ranking, response rating, adversarial prompt testing, hallucination detection, and safety/bias evaluation across GPT-4, Claude, Gemini, Llama, and Mistral. Documented recurring ambiguities and contributed to guideline improvements.

AI Annotation Mentor & Training Coach at Ajira Digital (Volunteer)

January 1, 2022 - December 31, 2023

Designed and delivered structured training on annotation quality, transcription, edge-case resolution, and calibration for 20+ contributors; raised cohort accuracy by 35% and cut onboarding time by 40%.

Language, Data & Annotation Specialist at CloudFactory

January 1, 2022 - December 31, 2022

Linguistic QA, content taxonomy tagging, NER, PII detection, audio transcription review, and speaker ID labeling across NLP/AI training datasets; met throughput benchmarks with first-submission accuracy; contributed rubric updates to reduce error recurrence.