Looks like you have JavaScript disabled. For the full Twine experience, you will need to re-enable it.

Hi, I’m Lubhansh Sharma. I have solid experience in AI response evaluation, factuality analysis, and reinforcement learning from human feedback (RLHF). I am passionate about contributing to the development of more accurate, ethical, and human-aligned AI systems. I enjoy ensuring that AI-generated content aligns with human expectations and is factually correct. Throughout my career, I’ve worked on training reward models and providing structured feedback to improve large language model behavior. I look forward to growing into more advanced roles within AI research and model alignment, continuously refining my analytical skills and advancing AI technologies in meaningful ways.…Hi, I’m Lubhansh Sharma. I have solid experience in AI response evaluation, factuality analysis, and reinforcement learning from human feedback (RLHF). I am passionate about contributing to the development of more accurate, ethical, and human-aligned AI systems. I enjoy ensuring that AI-generated content aligns with human expectations and is factually correct. Throughout my career, I’ve worked on training reward models and providing structured feedback to improve large language model behavior. I look forward to growing into more advanced roles within AI research and model alignment, continuously refining my analytical skills and advancing AI technologies in meaningful ways.

Lubhansh Sharma

Add your roles











Hi, I’m Lubhansh Sharma. I have solid experience in AI response evaluation, factuality analysis, and reinforcement learning from human feedback (RLHF). I am passionate about contributing to the development of more accurate, ethical, and human-aligned AI systems. I enjoy ensuring that AI-generated content aligns with human expectations and is factually correct. Throughout my career, I’ve worked on training reward models and providing structured feedback to improve large language model behavior. I look forward to growing into more advanced roles within AI research and model alignment, continuously refining my analytical skills and advancing AI technologies in meaningful ways.…Hi, I’m Lubhansh Sharma. I have solid experience in AI response evaluation, factuality analysis, and reinforcement learning from human feedback (RLHF). I am passionate about contributing to the development of more accurate, ethical, and human-aligned AI systems. I enjoy ensuring that AI-generated content aligns with human expectations and is factually correct. Throughout my career, I’ve worked on training reward models and providing structured feedback to improve large language model behavior. I look forward to growing into more advanced roles within AI research and model alignment, continuously refining my analytical skills and advancing AI technologies in meaningful ways.

Available to hire

Skills

AI Data Labelling

Experience Level

Expert

Expert

AI Data Labelling

Intermediate

Language

English

Advanced

Hindi

Advanced

Work Experience

A.I Response Evaluator (Freelancer) at Outlier AI

January 1, 2023 - August 31, 2023

Contributed to the evaluation and fine-tuning of large language models by providing high-quality human feedback on AI-generated responses. Focused on factuality assessment, coherence checking, and preference ranking to support Reinforcement Learning from Human Feedback (RLHF) and improve model alignment with human expectations.

Data Annotator (Full-Time) at Parallel Dots Technologies

January 1, 2021 - March 31, 2022

Experience in accurately labeling and categorizing large datasets to train AI and machine learning models. Worked with text, image, audio, and video annotation, ensuring high-quality data that enhances model accuracy. Responsibilities included following detailed annotation guidelines, performing quality checks, and collaborating with AI researchers to refine training datasets.

Advance AI Data Trainer (Full-Time) at Invisible Technologies

January 1, 2022 - March 31, 2022

Worked on Google's Gemini project focusing on Reward Model Training and Reinforcement Learning from Human Feedback (RLHF). Responsibilities included evaluating AI-generated responses for factual accuracy, coherence, and relevance, and providing high-quality human feedback to fine-tune large language models' performance.

Education

B.Tech at Dr. A.P.J Abdul Kalam Technical University

January 1, 2022 - December 31, 2025

Polytechnic Diploma at Board of Technical Education Uttar Pradesh

January 1, 2019 - December 31, 2022

Qualifications

Data Analytics

January 1, 2021 - December 31, 2021

Machine Learning

January 1, 2021 - December 31, 2021

Python Programming

January 1, 2021 - December 31, 2021

Data Science

January 1, 2021 - December 31, 2021

Industry Experience

Software & Internet, Professional Services, Computers & Electronics

Skills

AI Data Labelling

Experience Level

Expert

Expert

AI Data Labelling

Intermediate

Hire a Freelancer

We have the best experts on Twine. Hire a freelancer in Agra today.

Find a Freelancer