I am a PhD-level AI specialist and data annotation expert with 8+ years of experience delivering high-impact work across the full AI training and evaluation stack — from multimodal annotation and LLM evaluation to ML engineering, generative AI assessment, and responsible AI research. I have contributed to frontier AI systems for the world's leading labs, consistently maintaining 95–98% first-submission accuracy across 300+ monthly tasks spanning text, image, video, audio, 3D, and code modalities. I combine deep technical expertise in machine learning, NLP, computer vision, and software engineering with the domain breadth — mathematics, physics, data science, cybersecurity, and STEM — that the most demanding AI training projects require. I am trusted with sensitive, high-stakes evaluation tasks including RLHF, red teaming, safety assessment, hallucination detection, and agent trajectory review across 20+ frontier models. I thrive in remote, asynchronous environments and deliver consistently without supervision.

PAUL MWAMBURI

I am a PhD-level AI specialist and data annotation expert with 8+ years of experience delivering high-impact work across the full AI training and evaluation stack — from multimodal annotation and LLM evaluation to ML engineering, generative AI assessment, and responsible AI research. I have contributed to frontier AI systems for the world's leading labs, consistently maintaining 95–98% first-submission accuracy across 300+ monthly tasks spanning text, image, video, audio, 3D, and code modalities. I combine deep technical expertise in machine learning, NLP, computer vision, and software engineering with the domain breadth — mathematics, physics, data science, cybersecurity, and STEM — that the most demanding AI training projects require. I am trusted with sensitive, high-stakes evaluation tasks including RLHF, red teaming, safety assessment, hallucination detection, and agent trajectory review across 20+ frontier models. I thrive in remote, asynchronous environments and deliver consistently without supervision.

Available to hire

I am a PhD-level AI specialist and data annotation expert with 8+ years of experience delivering high-impact work across the full AI training and evaluation stack — from multimodal annotation and LLM evaluation to ML engineering, generative AI assessment, and responsible AI research. I have contributed to frontier AI systems for the world’s leading labs, consistently maintaining 95–98% first-submission accuracy across 300+ monthly tasks spanning text, image, video, audio, 3D, and code modalities.

I combine deep technical expertise in machine learning, NLP, computer vision, and software engineering with the domain breadth — mathematics, physics, data science, cybersecurity, and STEM — that the most demanding AI training projects require. I am trusted with sensitive, high-stakes evaluation tasks including RLHF, red teaming, safety assessment, hallucination detection, and agent trajectory review across 20+ frontier models. I thrive in remote, asynchronous environments and deliver consistently without supervision.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
See more

Language

English
Fluent
Swahili
Fluent

Work Experience

AI Data Annotation & Evaluation Specialist at TELUS International AI Data Solutions
January 1, 2024 - Present
Annotate and quality-review 300+ multimedia tasks monthly across text, audio, image, and video modalities, applying structured labeling rules with 95–98% first-submission accuracy. Tasks include NER, sentiment, bounding boxes, segmentation, ASR QA, speaker diarization, RLHF ranking, and video temporal labeling. Perform rigorous self-QA, identify edge cases, and document rubric improvements that reduced rework and reviewer variance.
LLM & Multimodal Annotation Expert at Mindrift / Outlier AI
January 1, 2023 - December 31, 2023
Delivered specialist annotation across audio, video, image, and text modalities with 98% QA compliance; conducted RLHF ranking, response rating, adversarial prompt testing, hallucination detection, and safety/bias evaluation across GPT-4, Claude, Gemini, Llama, and Mistral. Documented recurring ambiguities and contributed to guideline improvements.
Language, Data & Annotation Specialist at CloudFactory
January 1, 2022 - December 31, 2022
Linguistic QA, content taxonomy tagging, NER, PII detection, audio transcription review, and speaker ID labeling across NLP/AI training datasets; met throughput benchmarks with first-submission accuracy; contributed rubric updates to reduce error recurrence.
AI Annotation Mentor & Training Coach at Ajira Digital (Volunteer)
January 1, 2022 - December 31, 2023
Designed and delivered structured training on annotation quality, transcription, edge-case resolution, and calibration for 20+ contributors; raised cohort accuracy by 35% and cut onboarding time by 40%.

Education

PhD, Computer Science — Machine Learning & AI at Harvard University
January 11, 2030 - June 9, 2026
M.Sc., Data Science & AI Engineering at University of South Africa (UNISA)
January 11, 2030 - June 9, 2026
B.Sc., Computer Science at University of Nairobi
January 11, 2030 - June 9, 2026
Certificate, AI & Machine Learning at eMobilis Institute of Technology
January 1, 2023 - June 9, 2026
Certificate, ICT at Taita Taveta National Polytechnic
January 1, 2022 - June 9, 2026
PhD, Computer Science — Machine Learning & AI at Harvard University
January 11, 2030 - June 9, 2026
M.Sc., Data Science & AI Engineering at University of South Africa (UNISA)
January 11, 2030 - June 9, 2026
B.Sc., Computer Science at University of Nairobi
January 11, 2030 - June 9, 2026

Qualifications

Machine Learning Specialization
January 11, 2030 - June 9, 2026
NLP Specialization
January 11, 2030 - June 9, 2026
LLM Prompt Engineering for Developers
January 11, 2030 - June 9, 2026
AI Ethics & Responsible AI
January 11, 2030 - June 9, 2026
Google Data Analytics
January 11, 2030 - June 9, 2026
Generative AI Fundamentals
January 11, 2030 - June 9, 2026
AWS Cloud Practitioner
January 11, 2030 - June 9, 2026
TensorFlow Developer Certificate
January 11, 2030 - June 9, 2026
Machine Learning Specialization
January 11, 2030 - June 9, 2026
NLP Specialization
January 11, 2030 - June 9, 2026
LLM Prompt Engineering for Developers
January 11, 2030 - June 9, 2026
AI Ethics & Responsible AI
January 11, 2030 - June 9, 2026
Google Data Analytics
January 11, 2030 - June 9, 2026
Generative AI Fundamentals
January 11, 2030 - June 9, 2026
AWS Cloud Practitioner
January 11, 2030 - June 9, 2026
TensorFlow Developer Certificate
January 11, 2030 - June 9, 2026

Industry Experience

Software & Internet, Professional Services, Media & Entertainment, Education