I am a highly driven researcher with a background in philosophy and a strong theoretical curiosity about reinforcement learning. My path in machine learning and data science has drawn me toward RL, particularly RLHF, because it sits at the intersection of environment modeling and human judgment. During my studies I explored large parts of the RL literature, including Barton and Sutton, and followed online coursework on DP-methods for solving MDPs. I am eager for an opportunity to deepen my learning in RLHF to bridge my knowledge with the needs of a team or project. In practice, I have worked with human annotations, dataset curation, and RAG concepts. At SODAS (University of Copenhagen) I created LLM-based annotations for a study on cooperation behavior, translating plain speech into labels {0, 1, -1}. My BA thesis examined evaluation metrics for machine translation, highlighting how high-quality human annotations drive model robustness and how human preferences can guide model outputs. I also augmented an existing dataset with AI-generated paraphrases to study data generation and labeling scrutiny, and I built a small FAISS-based vector store to explore RAG workflows using my own writings.

Malte Ro Buchwald

PRO

I am a highly driven researcher with a background in philosophy and a strong theoretical curiosity about reinforcement learning. My path in machine learning and data science has drawn me toward RL, particularly RLHF, because it sits at the intersection of environment modeling and human judgment. During my studies I explored large parts of the RL literature, including Barton and Sutton, and followed online coursework on DP-methods for solving MDPs. I am eager for an opportunity to deepen my learning in RLHF to bridge my knowledge with the needs of a team or project. In practice, I have worked with human annotations, dataset curation, and RAG concepts. At SODAS (University of Copenhagen) I created LLM-based annotations for a study on cooperation behavior, translating plain speech into labels {0, 1, -1}. My BA thesis examined evaluation metrics for machine translation, highlighting how high-quality human annotations drive model robustness and how human preferences can guide model outputs. I also augmented an existing dataset with AI-generated paraphrases to study data generation and labeling scrutiny, and I built a small FAISS-based vector store to explore RAG workflows using my own writings.

Available to hire

I am a highly driven researcher with a background in philosophy and a strong theoretical curiosity about reinforcement learning. My path in machine learning and data science has drawn me toward RL, particularly RLHF, because it sits at the intersection of environment modeling and human judgment. During my studies I explored large parts of the RL literature, including Barton and Sutton, and followed online coursework on DP-methods for solving MDPs. I am eager for an opportunity to deepen my learning in RLHF to bridge my knowledge with the needs of a team or project.

In practice, I have worked with human annotations, dataset curation, and RAG concepts. At SODAS (University of Copenhagen) I created LLM-based annotations for a study on cooperation behavior, translating plain speech into labels {0, 1, -1}. My BA thesis examined evaluation metrics for machine translation, highlighting how high-quality human annotations drive model robustness and how human preferences can guide model outputs. I also augmented an existing dataset with AI-generated paraphrases to study data generation and labeling scrutiny, and I built a small FAISS-based vector store to explore RAG workflows using my own writings.

See more

Experience Level

Expert
Expert
Expert
Intermediate
Intermediate

Language

Danish
Fluent
English
Advanced
French
Intermediate

Work Experience

Student Assistant at SODAS, Københavns Universitet
January 1, 2024 - Present
Created LLM-based annotations for a dataset used for a study of cooperation behavior, mapping from plain speech to labels {0, 1, -1}. Also contributed to fine-tuning an LLM during the position.

Education

Matematisk studentereksamen at Falkonergaarden
January 1, 2004 - January 1, 2004
BA i Filosofi at Københavns Universitet
January 1, 2005 - January 1, 2008
Master i Filosophy at Københavns Universitet
January 1, 2008 - January 1, 2011
Udveksling at Sorbonne-Paris-IV, Institut for Politisk Filosofi
January 1, 2009 - January 1, 2010
BA in Machine Learning og Datascience at Københavns Universitet
January 1, 2021 - January 1, 2025

Qualifications

Add your qualifications or awards here.

Industry Experience

Education, Software & Internet, Professional Services, Media & Entertainment, Other