I am a detail-oriented AI machine learning data linguist who thrives on turning complex data into actionable insights. I have led data preprocessing, language data annotation, red-teaming guidelines, and conversations design for multilingual agent models across ecommerce, banking, travel, medical, and insurance domains. I enjoy collaborating with language scientists and engineers, mentoring teammates, and designing scalable data pipelines and editorial content to streamline model evaluation and knowledge bases. I stay current with industry trends and apply practical, user-focused solutions to improve model accuracy, safety, and user experience.

Kelly Teves Pereira

I am a detail-oriented AI machine learning data linguist who thrives on turning complex data into actionable insights. I have led data preprocessing, language data annotation, red-teaming guidelines, and conversations design for multilingual agent models across ecommerce, banking, travel, medical, and insurance domains. I enjoy collaborating with language scientists and engineers, mentoring teammates, and designing scalable data pipelines and editorial content to streamline model evaluation and knowledge bases. I stay current with industry trends and apply practical, user-focused solutions to improve model accuracy, safety, and user experience.

Available to hire

I am a detail-oriented AI machine learning data linguist who thrives on turning complex data into actionable insights. I have led data preprocessing, language data annotation, red-teaming guidelines, and conversations design for multilingual agent models across ecommerce, banking, travel, medical, and insurance domains.

I enjoy collaborating with language scientists and engineers, mentoring teammates, and designing scalable data pipelines and editorial content to streamline model evaluation and knowledge bases. I stay current with industry trends and apply practical, user-focused solutions to improve model accuracy, safety, and user experience.

See more

Language

Portuguese
Fluent
English
Fluent

Work Experience

Lead ML Data Linguist II and Prompt Engineer at Amazon / Alexa Data Services – Virtual for AWS Bedrock
May 1, 2024 - Present
Engineered and optimized prompts for diverse LLM applications including summarization, classification, translation, and complex agent workflows, significantly improving model performance. Developed multilingual prompts using LLM Explorer for large-scale synthetic data generation, while providing critical feedback to language engineering teams contributing to API improvement. Subject matter expert in synthetic data generation leading document pool initiatives for RAG and guardrails implementation, creating comprehensive knowledge bases and establishing benchmarking protocols. Designed and implemented conversational AI frameworks across e-commerce, banking, healthcare, and insurance. Spearheaded technical documentation for AWS Bedrock, resulting in increased customer engagement and revenue growth. Applied advanced linguistic analysis to enhance customer interaction models. Conducted quality tests via Appen backend as project lead, resulting in high quality metrics. Orchestrated data prep
Lead ML Data Associate II at Amazon / Alexa Data Services – Virtual for BOS20
October 1, 2014 - May 1, 2024
Engineered machine learning models achieving 33.3% accuracy improvement in predictive analytics, customer segmentation, and demand forecasting. Developed and implemented data preprocessing workflows for secure critical and non-critical datasets, optimizing model training efficiency. Leveraged advanced ML algorithms to analyze large-scale datasets, extracting actionable customer behavior insights for the en_US market. Created, managed, and deployed editorial content using HTML/XML for global Alexa device interfaces for en_US market, including daily themes and cookbook publications. Conducted comprehensive model performance evaluations, including audio linguistics annotation with IVONA, parameter tuning, and algorithm optimization. Contributed to ML pipeline development focusing on scalability and automated data processing. Participated in UAT, UI, and UX design focus groups, adhering to leadership principles. Collaborated with software engineers to deploy ML models in production environ
Lead ML Data Linguist II and Prompt Engineer at Amazon Web Services (AWS) – Remote (Santa Clara, CA; Seattle, WA; NYC, NY)
May 1, 2024 - Present
Led data preprocessing, data cleaning, and dataset preparation for model training in highly secure environments; designed conversations between users and agent models across ecommerce, banking, travel, medical, and insurance domains; helped develop red-teaming guidelines for chatbot design; conducted QA testing via Appen backend; produced technical writing materials and led a wiki/SharePoint site design team for internal and external AWS Bedrock initiatives; generated multilingual JSON blueprints and analyzed filled PDFs; applied sociolinguistic analysis to understand customer behavior and dialect influence; evaluated AI responses in HTML/XML/JSON and supported RAG and guardrails strategies; mentored new hires and supported production deployments.
Lead ML Data Associate II at Amazon / Artificial Generation Intelligence (AGI) + Alexa Data Services (ADS)
October 1, 2014 - May 1, 2024
Applied advanced ML algorithms to analyze large-scale datasets and derive insights into customer behavior; created editorial content and managed HTML/XML-based content in CMS; scheduled deployments of daily themes and cookbooks on Alexa devices for a global market; performed data annotation across multiple domains; built scalable ML pipelines and participated in UAT/UI/UX focus groups and A/B testing; trained teams on data quality evaluation; contributed to model deployment and production integration; served as Portuguese SME for slang and vocabulary; provided multi-modal labeling and transcription, including photo/video editing and translation support.
Lead ML Data Associate II at Amazon / AGI + Alexa Data Services (ADS)
October 1, 2014 - May 1, 2024
Led data preprocessing and data cleaning to prepare datasets for model training in highly secure environments; designed conversations between users and agent models for ecommerce, banking, travel, medical, and insurance contexts; assisted language data scientists in red-teaming guidelines; conducted QA tests and supported data linguists to achieve high quality metrics; contributed to web and wiki content development; supervised multilingual JSON blueprint generation for model evaluation; provided mentorship and stakeholder-facing updates; managed editorial content and localization efforts.
Lead ML Data Linguist II and Prompt Engineer at Amazon Web Services (AWS) – AGI + Alexa Data Services (ADS)
May 1, 2024 - Present
Developed preprocessing pipelines and agent conversations across ecommerce, banking, travel, medical and insurance domains; led data linguist teams and provided guidance on red-teaming guidelines; performed QA via Appen backend; delivered technical writing and led a website design team with A/B testing for internal and external wiki/sharepoint sites; generated multilingual JSON blueprints and analyzed data for model evaluation; contributed to multilingual prompt design, RAG strategies, guardrails, and model evaluation for en_US market.

Education

Artificial Intelligence Practitioner Learning Plan Certification at Amazon Web Services (AWS)
January 11, 2030 - January 28, 2026
Python and SQL for Machine Learning and Data Engineering at LinkedIn Learning
January 11, 2030 - January 28, 2026
Event coordination and design at ACPWC
January 11, 2030 - January 28, 2026
Associates Degree in Business Office Administration & Transcription at Bristol Community College
January 11, 2030 - January 28, 2026
Associates Degree in Business Office Administration & Transcription at Bristol Community College
January 11, 2030 - March 6, 2026
Portuguese Language Certification at Sylvia Portuguese School
January 11, 2030 - March 6, 2026
AWS Artificial Intelligence Practitioner Certification at Amazon Web Services (AWS)
January 11, 2030 - March 6, 2026
Python and SQL for Machine Learning and Data Engineering at LinkedIn Learning
January 11, 2030 - March 6, 2026
Associates Degree in Business Office Administration & Transcription at Bristol Community College
January 11, 2030 - March 6, 2026
Nursing, Science, and Psychology at Bristol Community College
January 11, 2030 - March 6, 2026
Portuguese Language Certification at Sylvia Portuguese School
January 11, 2030 - March 6, 2026
AWS Artificial Intelligence Practitioner Certification at Amazon Web Services (AWS)
January 11, 2030 - March 6, 2026
Python and SQL for Machine Learning and Data Engineering at LinkedIn Learning
January 11, 2030 - March 6, 2026

Qualifications

Artificial Intelligence Practitioner Learning Plan Certification
January 11, 2030 - January 28, 2026
Python and SQL for Machine Learning and Data Engineering
January 11, 2030 - January 28, 2026
ACPWC Event coordination and design
January 11, 2030 - January 28, 2026
AWS Artificial Intelligence Practitioner Certification
January 11, 2030 - March 6, 2026
Python and SQL for Machine Learning and Data Engineering
January 11, 2030 - March 6, 2026
Portuguese Language Certification
January 11, 2030 - March 6, 2026
AWS Artificial Intelligence Practitioner Certification
January 11, 2030 - March 6, 2026
Python and SQL for Machine Learning and Data Engineering
January 11, 2030 - March 6, 2026
Portuguese Language Certification
January 11, 2030 - March 6, 2026

Industry Experience

Software & Internet, Healthcare, Financial Services, Retail, Professional Services, Media & Entertainment