I am a Data Scientist and AI Data Quality Specialist with a strong foundation in statistics, mathematics, and computational methods. I bring hands-on experience in statistical modeling, data pipelines, and AI model training through large-scale annotation and pairwise evaluation. A fast learner by nature, I adapt quickly to new tools and environments and am open to diverse opportunities beyond my core specialization. I am committed to delivering high-integrity, evidence-based solutions wherever my skills are needed.

Colleta Aaliyah

I am a Data Scientist and AI Data Quality Specialist with a strong foundation in statistics, mathematics, and computational methods. I bring hands-on experience in statistical modeling, data pipelines, and AI model training through large-scale annotation and pairwise evaluation. A fast learner by nature, I adapt quickly to new tools and environments and am open to diverse opportunities beyond my core specialization. I am committed to delivering high-integrity, evidence-based solutions wherever my skills are needed.

Next availability:
April 21, 2026

I am a Data Scientist and AI Data Quality Specialist with a strong foundation in statistics, mathematics, and computational methods. I bring hands-on experience in statistical modeling, data pipelines, and AI model training through large-scale annotation and pairwise evaluation. A fast learner by nature, I adapt quickly to new tools and environments and am open to diverse opportunities beyond my core specialization. I am committed to delivering high-integrity, evidence-based solutions wherever my skills are needed.

See more

Language

English
Fluent

Work Experience

AI Data Specialist at Train AI Community- RWS group, Prolific, Data Annotation
January 14, 2025 - Present
• Elevated AI output quality by executing rigorous pairwise evaluations and feedback, directly optimizing model alignment and response relevance. • Expedited ML training cycles by processing large-scale multimodal datasets, transforming raw data into model-ready training assets. • Minimized training failures by maintaining 100% annotation accuracy, eliminating costly data re-processing and ensuring dataset integrity.
Data scientist at Tech Camp Kenya
November 13, 2023 - January 7, 2025
• Streamlined data pipelines using Python/Pandas to eliminate 95% of noise, transforming raw datasets into high-integrity assets for analysis. • Accelerated executive decision-making by translating complex data into visualizations that drove high-stakes investment and resource allocation. • Maximized asset uptime by developing predictive models to optimize maintenance schedules and significantly reduce unplanned outages
Data Scientist at TechCamp Kenya
November 1, 2023 - January 1, 2025
Engineered Python/Pandas data pipelines, developed executive-level dashboards and data visualizations, built predictive maintenance models, and applied statistical modeling to large datasets to support strategic decisions.
Data Quality Specialist (Freelance) at Train AI Community – RWS Group, Prolific, Data Annotation
January 1, 2025 - Present
Elevated AI model output quality through rigorous pairwise evaluations and targeted feedback; processed large-scale multimodal datasets into model-ready training assets; achieved and maintained 100% annotation accuracy; collaborated with cross-functional AI research teams to uphold data quality standards.

Education

Bachelor of Science in astronomy and astrophysics at University of Nairobi
August 31, 2020 - July 19, 2024
Bachelor of Science in Astronomy and Astrophysics at University of Nairobi
August 1, 2020 - December 1, 2024

Qualifications

AI Augmented Professional Development Skills – ALX Kenya
December 1, 2024 - April 20, 2026
Certificate in Data Science – TechCamp Kenya
November 1, 2023 - April 20, 2026

Industry Experience

Computers & Electronics, Energy & Utilities, Non-Profit Organization, Professional Services, Software & Internet