Hi, I’m Lubhansh Sharma. I have solid experience in AI response evaluation, factuality analysis, and reinforcement learning from human feedback (RLHF). I am passionate about contributing to the development of more accurate, ethical, and human-aligned AI systems. I enjoy ensuring that AI-generated content aligns with human expectations and is factually correct. Throughout my career, I’ve worked on training reward models and providing structured feedback to improve large language model behavior. I look forward to growing into more advanced roles within AI research and model alignment, continuously refining my analytical skills and advancing AI technologies in meaningful ways.

Lubhansh Sharma

Add your roles

Hi, I’m Lubhansh Sharma. I have solid experience in AI response evaluation, factuality analysis, and reinforcement learning from human feedback (RLHF). I am passionate about contributing to the development of more accurate, ethical, and human-aligned AI systems. I enjoy ensuring that AI-generated content aligns with human expectations and is factually correct. Throughout my career, I’ve worked on training reward models and providing structured feedback to improve large language model behavior. I look forward to growing into more advanced roles within AI research and model alignment, continuously refining my analytical skills and advancing AI technologies in meaningful ways.

Available to hire
See more

Language

English
Advanced
Hindi
Advanced

Work Experience

A.I Response Evaluator (Freelancer) at Outlier AI
January 1, 2023 - August 31, 2023
Contributed to the evaluation and fine-tuning of large language models by providing high-quality human feedback on AI-generated responses. Focused on factuality assessment, coherence checking, and preference ranking to support Reinforcement Learning from Human Feedback (RLHF) and improve model alignment with human expectations.
Data Annotator (Full-Time) at Parallel Dots Technologies
January 1, 2021 - March 31, 2022
Experience in accurately labeling and categorizing large datasets to train AI and machine learning models. Worked with text, image, audio, and video annotation, ensuring high-quality data that enhances model accuracy. Responsibilities included following detailed annotation guidelines, performing quality checks, and collaborating with AI researchers to refine training datasets.
Advance AI Data Trainer (Full-Time) at Invisible Technologies
January 1, 2022 - March 31, 2022
Worked on Google's Gemini project focusing on Reward Model Training and Reinforcement Learning from Human Feedback (RLHF). Responsibilities included evaluating AI-generated responses for factual accuracy, coherence, and relevance, and providing high-quality human feedback to fine-tune large language models' performance.

Education

B.Tech at Dr. A.P.J Abdul Kalam Technical University
January 1, 2022 - December 31, 2025
Polytechnic Diploma at Board of Technical Education Uttar Pradesh
January 1, 2019 - December 31, 2022

Qualifications

Data Analytics
January 1, 2021 - December 31, 2021
Machine Learning
January 1, 2021 - December 31, 2021
Python Programming
January 1, 2021 - December 31, 2021
Data Science
January 1, 2021 - December 31, 2021

Industry Experience

Software & Internet, Professional Services, Computers & Electronics

Hire a Freelancer

We have the best experts on Twine. Hire a freelancer in Agra today.