Hello,
I’m a Hindi linguist, voice professional, and AI data specialist with experience contributing both OCR-ready Hindi text-image datasets and Hindi/Indian English voice data for AI model training.
On the OCR side, I create and annotate Hindi (Devanagari) text images suitable for training and evaluating OCR systems. This includes clean image capture, accurate ground-truth transcription, and structured annotations aligned with model-training requirements.
On the voice / TTS side, I’m a native Hindi female speaker with extensive experience recording high-quality Hindi and Hinglish speech data for text-to-speech, speech recognition, IVR, and AI voice systems. I’ve delivered enterprise-grade voice datasets following strict audio, consistency, and script-adherence guidelines.
What I offer:
• Hindi OCR datasets – printed text images with verified transcriptions and optional line- or word-level bounding boxes
• Hindi & Hinglish voice datasets – neutral and expressive female voice suitable for TTS and ASR
• Strong understanding of AI data quality standards, accuracy checks, and common OCR/speech failure cases
• Ability to deliver pilot samples first, then scale to larger structured datasets
• Clear documentation and usage-rights clarity for commercial AI training
In addition, I’ve worked as an AI trainer and response evaluator, which helps me align data creation with real model-training needs rather than surface-level annotation.
I’d be happy to share samples or discuss dataset specifications such as duration, annotation format, or delivery structure.
Thank you for your time, and I look forward to collaborating.
Best regards,
Meenakshi Verma
Hindi Linguist | OCR & Voice TTS Data Contributor
Skills
Experience Level
Language
Work Experience
Education
Qualifications
Industry Experience
Skills
Experience Level
Hire a Voiceover Artist
We have the best voiceover artist experts on Twine. Hire a voiceover artist in Noida today.