I am Masoud Karami, a PhD candidate in Computer Engineering at Polytechnique Montréal. I develop cognitively inspired evaluation frameworks for Large Language Models (LLMs), focusing on robustness, reasoning, and human alignment. I have designed a Python-based LLM Serial Memory Task pipeline to evaluate recall accuracy, forgetting dynamics, and distractor handling, and I have integrated it into real-world software engineering tasks such as automated code review. My work benchmarks over 20 foundation models (e.g., LLaMA-2, LLaMA-3, CodeLlama, Mistral, Phi, Qwen, Claude, GPT-3.5, GPT-4) using large-scale HPC workflows on Compute Canada. I investigate RLHF and prompt engineering (chain-of-thought, step-back prompting) and deliver reproducible, high-performance evaluation workflows.

Masoud Karami

I am Masoud Karami, a PhD candidate in Computer Engineering at Polytechnique Montréal. I develop cognitively inspired evaluation frameworks for Large Language Models (LLMs), focusing on robustness, reasoning, and human alignment. I have designed a Python-based LLM Serial Memory Task pipeline to evaluate recall accuracy, forgetting dynamics, and distractor handling, and I have integrated it into real-world software engineering tasks such as automated code review. My work benchmarks over 20 foundation models (e.g., LLaMA-2, LLaMA-3, CodeLlama, Mistral, Phi, Qwen, Claude, GPT-3.5, GPT-4) using large-scale HPC workflows on Compute Canada. I investigate RLHF and prompt engineering (chain-of-thought, step-back prompting) and deliver reproducible, high-performance evaluation workflows.

Available to hire

I am Masoud Karami, a PhD candidate in Computer Engineering at Polytechnique Montréal. I develop cognitively inspired evaluation frameworks for Large Language Models (LLMs), focusing on robustness, reasoning, and human alignment. I have designed a Python-based LLM Serial Memory Task pipeline to evaluate recall accuracy, forgetting dynamics, and distractor handling, and I have integrated it into real-world software engineering tasks such as automated code review.

My work benchmarks over 20 foundation models (e.g., LLaMA-2, LLaMA-3, CodeLlama, Mistral, Phi, Qwen, Claude, GPT-3.5, GPT-4) using large-scale HPC workflows on Compute Canada. I investigate RLHF and prompt engineering (chain-of-thought, step-back prompting) and deliver reproducible, high-performance evaluation workflows.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert

Language

English
Advanced
French
Advanced

Work Experience

Add your work experience history here.

Education

Ph.D. Candidate in Computer Engineering at Polytechnique Montréal
January 1, 2020 - October 26, 2025

Qualifications

M.Sc. in Applied Mathematics – Thesis: A Heuristic Approach for Solving the Gravity p-Median Problem
January 11, 2030 - October 26, 2025
B.Sc. in Applied Mathematics
January 11, 2030 - October 26, 2025

Industry Experience

Software & Internet, Education, Professional Services, Media & Entertainment, Other