Available to hire
I am a data scientist based in Vancouver with 5+ years of experience delivering end-to-end AI/ML deployments, data pipelines, and information retrieval improvements in archival and education settings. I enjoy building scalable systems that turn messy data into actionable insights and reliable AI-powered workflows.
Over the years I’ve driven measurable impact, including a 40% improvement in retrieval accuracy through RAG architectures and a 75% reduction in manual workloads via automation tools. I’ve contributed to 10+ peer-reviewed publications and thrive collaborating across teams to translate domain needs into practical, production-ready solutions.
Skills
Language
English
Fluent
Spanish; Castilian
Fluent
Work Experience
Research Assistant: Deep Learning & NLP at The University of British Columbia
December 1, 2021 - October 30, 2025Architected production-grade RAG systems leveraging LangChain and Ollama, enabling more reliable information retrieval and a 40% improvement in accuracy for research applications. Designed end-to-end data pipelines for multilingual text corpus ETL, enabling scalable workflows and reducing data bottlenecks for model training. Orchestrated distributed processing of 10 TB multilingual data (text/image/audio) via PySpark, supporting research in machine translation, image QA, LLM training, and audio ASR. Engineered scalable web-scraping infrastructure across 500+ domains to diversify datasets and improve Arabic NLP research.
ML Engineering Consultant at InterPares ITrust AI
November 1, 2022 - October 30, 2025Led ML projects to detect diplomatic elements and PII in digitized archives, enabling secure data handling and boosting document extraction accuracy from 40% to 95% using a state-of-the-art Nougat OCR model. Architected an end-to-end RAG system and fine-tuned embedding models with hard-negative mining to improve document understanding and retrieval precision for sensitive records. Streamlined model evaluation by designing Label Studio annotation projects and self-evaluation workflows, reducing the need for specialized labeling and accelerating iteration cycles.
Operations Specialist - Data Engineering and Detection at HSBC
September 1, 2019 - September 1, 2019Streamlined document-handling processes, reducing turnaround time by 30% for small-business accounts while maintaining confidentiality. Maintained 99% data integrity by validating and entering data into the Integrated Data Environment (IDE), enabling precise financial reporting. Conducted risk analysis on client profiles and transactions to flag potential issues and ensure regulatory compliance.
KYC Data Analyst at HSBC
June 1, 2018 - June 1, 2018Automated KYC workflows via VBA macros in MS Excel for data collection, reducing manual processing time by 60%. Analyzed client data via CLI to generate risk-profile dashboards supporting due-diligence decisions. Produced regular reporting on client onboarding metrics and risk trends to drive process improvements.
Education
Data Science Diploma at BrainStation
February 1, 2021 - May 1, 2021Bachelor of Science in Applied Mathematics at SUNY Buffalo State University
August 1, 2013 - May 1, 2017Qualifications
DeepLearning AI Data Engineering Specialization
January 1, 2025 - October 30, 2025Deep Learning Specialization
January 1, 2023 - October 30, 2025Industry Experience
Education, Media & Entertainment, Professional Services
Skills
Hire a Data Scientist
We have the best data scientist experts on Twine. Hire a data scientist in Vancouver today.