I am a hybrid Data Scientist and Data Engineer with 8+ years of experience designing, building, and deploying end-to-end data, analytics, and AI solutions in production. I specialize in scalable data platforms on AWS, Spark, and Hadoop, and I apply ML, NLP, and RAG techniques to deliver measurable business outcomes across finance, real estate, telecom, and manufacturing. I enjoy leading cross-functional teams, turning complex data into actionable insights, and continuously improving MLOps practices to ensure reliable, auditable models. I’m motivated by solving real-world problems and helping organizations unlock the value of their data.

Rafael Augusto Gutierrez Noriega

I am a hybrid Data Scientist and Data Engineer with 8+ years of experience designing, building, and deploying end-to-end data, analytics, and AI solutions in production. I specialize in scalable data platforms on AWS, Spark, and Hadoop, and I apply ML, NLP, and RAG techniques to deliver measurable business outcomes across finance, real estate, telecom, and manufacturing. I enjoy leading cross-functional teams, turning complex data into actionable insights, and continuously improving MLOps practices to ensure reliable, auditable models. I’m motivated by solving real-world problems and helping organizations unlock the value of their data.

Available to hire

I am a hybrid Data Scientist and Data Engineer with 8+ years of experience designing, building, and deploying end-to-end data, analytics, and AI solutions in production. I specialize in scalable data platforms on AWS, Spark, and Hadoop, and I apply ML, NLP, and RAG techniques to deliver measurable business outcomes across finance, real estate, telecom, and manufacturing.

I enjoy leading cross-functional teams, turning complex data into actionable insights, and continuously improving MLOps practices to ensure reliable, auditable models. I’m motivated by solving real-world problems and helping organizations unlock the value of their data.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
See more

Language

Spanish; Castilian
Fluent
English
Advanced

Work Experience

Principal Data Scientist at Centro Tecnológico del Notariado
May 1, 2023 - Present
Analytical and technical leader for the Notarial Real Estate Information Portal on AWS. Designed analytical layers processing millions of notarized real estate transactions; developed the first automated real estate valuation model (AVM) using historical data; integrated OpenStreetMap geospatial data to enrich property-level analyses; explored RAG frameworks to support an assisted business intelligence layer with guided insights and natural language interactions.
Consultant Advanced Analytics at Axis corporate
May 1, 2022 - April 30, 2023
Extracted key information from real estate contracts using NLP and conversational AI; built a Power BI visualization layer for commissions, profitability and growth; developed a forecasting engine using Prophet in Python and AWS SageMaker; contributed to budgeting and planning improvements for a major airline and other clients.
Senior Data Scientist at Minsait
May 1, 2021 - May 31, 2022
Designed and built a business intelligence layer for the Finance controlling office; automated monthly sales performance reports; managed analytics projects using agile methodologies; collaborated with Data Engineers to implement MLOps and DataOps for a large bank; performed customer segmentation and NLP-driven insights to optimize marketing and engagement.
Big data and Advanced Analytics Consultant at SDG - Barcelona
July 1, 2019 - May 31, 2021
Developed analytical solutions for the banking sector, including a credit card transaction risk scoring model; led a team of data scientists and engineers; designed ETL pipelines and reports using PL SQL, Python, PySpark and DataRobot.
Data Scientist at everis
June 1, 2018 - June 30, 2019
Created machine learning models to detect fraud for international calls for Movistar; implemented real-time fraud detection via Spark Streaming on a Big Data Cluster; achieved around 40% improvement over existing decision rules.
Statistical Specialist / Senior Analyst / SAS Consultant
January 1, 2002 - December 31, 2016
Led large-scale predictive modeling and optimization initiatives in mining, energy and finance; delivered Six Sigma, Lean and Theory of Constraints projects improving production efficiency and reducing costs; migrated from SAS/Oracle to big data platforms and trained regional teams.

Education

Statistics at National University of Colombia, Faculty of Science
January 11, 2030 - January 1, 2006
Master in Intelligent and Interactive Systems at University of Pompeu Fabra
January 11, 2030 - January 1, 2025

Qualifications

Practical Method to Estimate the Confidence Intervals for the Mean for the Relative Error using the Bootstrap Methodology
January 1, 2007 - January 13, 2026

Industry Experience

Financial Services, Real Estate & Construction, Telecommunications, Manufacturing, Software & Internet, Media & Entertainment