I’m Arthur Santana, a PhD in Computational Linguistics with extensive hands-on experience in NLP, language data QA, and linguistics-driven AI development. I design grammars, annotate linguistic data, and build NLU/NLG systems, always aiming to improve language technology’s accuracy and usability. Over the years I’ve collaborated with Google Brazil, Telus Digital, Cerence, and academic teams across Brazil and the US, leading training, quality control, and data-driven improvements for multilingual NLP products. I’m passionate about bridging research and product, mentoring teams, and turning linguistic insight into robust AI features.

Arthur Santana

I’m Arthur Santana, a PhD in Computational Linguistics with extensive hands-on experience in NLP, language data QA, and linguistics-driven AI development. I design grammars, annotate linguistic data, and build NLU/NLG systems, always aiming to improve language technology’s accuracy and usability. Over the years I’ve collaborated with Google Brazil, Telus Digital, Cerence, and academic teams across Brazil and the US, leading training, quality control, and data-driven improvements for multilingual NLP products. I’m passionate about bridging research and product, mentoring teams, and turning linguistic insight into robust AI features.

Available to hire

I’m Arthur Santana, a PhD in Computational Linguistics with extensive hands-on experience in NLP, language data QA, and linguistics-driven AI development. I design grammars, annotate linguistic data, and build NLU/NLG systems, always aiming to improve language technology’s accuracy and usability.

Over the years I’ve collaborated with Google Brazil, Telus Digital, Cerence, and academic teams across Brazil and the US, leading training, quality control, and data-driven improvements for multilingual NLP products. I’m passionate about bridging research and product, mentoring teams, and turning linguistic insight into robust AI features.

See more

Experience Level

Expert
Intermediate
Intermediate

Language

Portuguese
Fluent
English
Fluent

Work Experience

Quality Controller at Telus Digital
June 1, 2023 - Present
Lead quality assurance for language data projects, ensuring accuracy, consistency, and alignment with client specifications; collaborate with project managers and subject matter experts to refine task guidelines and resolve quality issues; conduct audits and performance reviews of annotators and re viewers; analyze error patterns and generate insights to improve data quality and team efficiency; support onboarding and training; create, update, and edit annotation and QA guidelines; use quality tracking tools and dashboards to report on KPIs, monitor SLA compliance, and drive continuous improvement.
Senior Linguist at Google Brazil (via Telus Digital)
June 1, 2025 - October 10, 2025
Support the Project Manager on management tasks; perform review of work authored by Linguists; elaborate and conduct training activities for Linguists and new hires; provide extensive linguistic expertise for developing an AI product using NLP techniques; develop complex grammars for NLU; create templates for NLG; evaluate current system outputs; annotate and review linguistic data.
Associate Linguist at Google Brazil (via Telus Digital)
September 1, 2023 - October 10, 2025
NLU grammar development; NLG development; elaborate guidelines for NLP projects; annotate and review linguistic data; provide linguistic analysis and support for NLU and NLG projects; response modeling; label text for disambiguation, expansion, and text normalization; annotating lexicon entries according to guidelines; deriving NLP data.
NLP Language Developer at Cerence (via Maize)
August 1, 2024 - October 10, 2025
Developed NLP components: write rules, build grammars/FSTs; contributed to the design of new features by extending existing annotation schemas to cover new areas; used modeling tools to bootstrap and test new functionalities; created annotation systems and mappings for Portuguese; analyzed system performance and identified areas for improvement.
Computational Linguist at Portuguese International Institute
July 1, 2022 - October 10, 2025
Supported the Scientific and Technical Portuguese Terminologies platform development; data cleaning and wrangling; flexional paradigm script design.
English Teacher at Cultura Inglesa
December 31, 2013 - October 10, 2025
English teacher teaching language to professionals; responsible for curriculum delivery and student progress.
English Teacher at English for Business
December 31, 2017 - October 10, 2025
English teacher focusing on business communication and professional language skills.
Teacher's Assistant at University of São Paulo (USP)
December 31, 2019 - October 10, 2025
Teaching assistant supporting linguistics coursework and student projects.

Education

PhD in Computational Linguistics at University of São Paulo
January 1, 2015 - January 1, 2019
Visiting PhD Scholar at University of Southern California
January 1, 2017 - January 1, 2018
MA in Linguistics at University of São Paulo
January 1, 2013 - January 1, 2015
BA in Letters at Federal University of Maranhão
January 1, 2008 - January 1, 2013

Qualifications

PhD Grant - National Council for Scientific and Technological Development (CNPq)
January 1, 2015 - December 31, 2019
International Visiting Scholar Grant - CNPq
January 1, 2017 - December 31, 2018
MA Scholarship - University of São Paulo
January 1, 2013 - January 1, 2015

Industry Experience

Software & Internet, Education, Media & Entertainment, Professional Services, Other