I am a Computational Linguist and NLP Specialist with 6+ years of experience in Machine Learning & Translation, AI evaluation, MT workflow design and multilingual NLP data projects. I am adept at using Python (Pandas, NumPy, JSON), Jupyter Notebook and prompt engineering to evaluate and improve MT systems. I have a strong foundation in linguistics, evaluation metrics and workflow optimization for global-scale language technology solutions. I have proven success in cross-functional teams delivering high-quality AI data and tools for enterprise use.

Smita Sahu

I am a Computational Linguist and NLP Specialist with 6+ years of experience in Machine Learning & Translation, AI evaluation, MT workflow design and multilingual NLP data projects. I am adept at using Python (Pandas, NumPy, JSON), Jupyter Notebook and prompt engineering to evaluate and improve MT systems. I have a strong foundation in linguistics, evaluation metrics and workflow optimization for global-scale language technology solutions. I have proven success in cross-functional teams delivering high-quality AI data and tools for enterprise use.

Available to hire

I am a Computational Linguist and NLP Specialist with 6+ years of experience in Machine Learning & Translation, AI evaluation, MT workflow design and multilingual NLP data projects.

I am adept at using Python (Pandas, NumPy, JSON), Jupyter Notebook and prompt engineering to evaluate and improve MT systems. I have a strong foundation in linguistics, evaluation metrics and workflow optimization for global-scale language technology solutions. I have proven success in cross-functional teams delivering high-quality AI data and tools for enterprise use.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
See more

Language

English
Fluent
Hindi
Fluent
Bengali
Fluent
Portuguese
Intermediate

Work Experience

Independent Contractor
January 1, 2025 - Present
Collaboration on advanced NLP research integrating frameworks into traditional computational pipelines. Applied hybrid methodologies to model semantic ambiguity and polysemy in machine translation and text generation systems. Designed prompt-based testing and evaluation tasks to measure linguistic phenomena using NLP probabilistic models. Produced technical documentation and analytical reports to inform model adjustments in enterprise NLP systems. Used Python and Jupyter Notebooks for data simulation and pattern recognition in linguistically complex datasets.
Computational Linguist at Defined AI
December 1, 2024 - October 14, 2025
Designed and evaluated MT workflows for NLP projects across multilingual and low-resource language pairs. Ran quality evaluations of translation output using custom and automated scoring metrics. Used Python and Jupyter Notebooks for MT data inspection, pre-processing and result reporting. Collaborated across teams to ensure alignment between user needs and AI solutions design. Spearheaded project planning and Agile ceremonies to ensure high delivery cadence. Managed cross-lingual AI dataset initiatives and multilingual AI readiness. Conducted stakeholder onboarding, feedback sessions and usage monitoring.
Data Quality Specialist
June 1, 2021 - October 14, 2025
Developed scalable QA strategies for AI datasets across multiple languages. Authored detailed annotation and evaluation guidelines, improving data delivery quality by 30%. Collaborated with R&D to identify and suggest tooling improvements for internal evaluation platforms.
Researcher at LDC-IL, CIIL
November 1, 2019 - October 14, 2025
Managed large-scale government data collection and language documentation projects. Coordinated cross-team workflows across linguists, annotators and researchers to ensure timely delivery of speech & text data. Supported user education around data tools and maintained project governance standards.
Linguistic Consultant at Independent Contractor
January 1, 2025 - Present
Linguistic consultant providing multilingual NLP workflow design and evaluation support for enterprise AI data projects. Led cross-functional collaboration with linguists, annotators and engineers to define data requirements, KPIs, and evaluation plans. Designed and implemented prompt-based testing and evaluation strategies to improve MT systems and NLP pipelines.
Data Quality Specialist at LDC-IL, CIIL
June 1, 2021 - October 14, 2025
Developed scalable QA strategies for AI datasets across multiple languages; authored detailed annotation and evaluation guidelines, improving data delivery quality by 30%; collaborated with R&D to identify and suggest tool improvements for internal evaluation platforms.
Researcher at LDC-IL, CIIL
November 1, 2019 - October 14, 2025
Managed large-scale government data collection and language documentation projects. Coordinated cross-team workflows across linguists, annotators and researchers to ensure timely delivery of speech & text data. Supported user education around data tools and maintained project governance standards.

Education

Bachelors in English Honors at Karim City College
January 11, 2030 - January 1, 2015
Master's in Linguistics at Banaras Hindu University
January 11, 2030 - January 1, 2017
Microsoft Certified: Azure AI Fundamentals at Microsoft
January 11, 2030 - January 1, 2023
Google Project Management Professional Certificate at Coursera
January 11, 2030 - January 1, 2024
Certified Scrum Product Owner (CSPO) at Scrum Alliance
January 11, 2030 - January 1, 2024
Product Owner Certification at Udemy
January 11, 2030 - January 1, 2024
Bachelor of English Honors at Karim City College, India
January 11, 2030 - January 1, 2015
Master's in Linguistics at Banaras Hindu University, India
January 11, 2030 - January 1, 2017
Microsoft Certified: Azure AI Fundamentals at Microsoft
January 11, 2030 - January 1, 2023
Google Project Management Professional Certificate at Coursera
January 11, 2030 - January 1, 2024
Certified Scrum Product Owner (CSPO) at Scrum Alliance
January 11, 2030 - January 1, 2024
Product Owner Certification at Udemy
January 11, 2030 - January 1, 2024

Qualifications

Microsoft Certified: Azure AI Fundamentals
January 1, 2023 - October 14, 2025
Google Project Management Professional Certificate
January 1, 2024 - October 14, 2025
Certified Scrum Product Owner (CSPO)
January 1, 2024 - October 14, 2025
Product Owner Certification
January 1, 2024 - October 14, 2025
Microsoft Certified: Azure AI Fundamentals
January 11, 2030 - January 1, 2023
Google Project Management Professional Certificate
January 11, 2030 - January 1, 2024
Certified Scrum Product Owner (CSPO)
January 11, 2030 - January 1, 2024
Product Owner Certification
January 11, 2030 - January 1, 2024

Industry Experience

Software & Internet, Professional Services, Education, Government, Other, Computers & Electronics, Media & Entertainment