I am Jose Manuel Diaz Urraco, Freelance Data Engineer since February 2025, with experience across startups, big tech, and consulting (Amazon, Slido/Cisco, Banco Santander). I design and deliver reliable batch and streaming data pipelines and modern data platforms, centralizing fragmented sources into a single source of truth, automating data quality, and enabling BI, ML, and GenAI use cases. I excel at communicating with technical and non-technical stakeholders in international environments, translating complex data requirements into scalable solutions, and delivering end-to-end data products that empower teams to derive actionable insights and unlock AI opportunities.

Jose Manuel Diaz Urraco

I am Jose Manuel Diaz Urraco, Freelance Data Engineer since February 2025, with experience across startups, big tech, and consulting (Amazon, Slido/Cisco, Banco Santander). I design and deliver reliable batch and streaming data pipelines and modern data platforms, centralizing fragmented sources into a single source of truth, automating data quality, and enabling BI, ML, and GenAI use cases. I excel at communicating with technical and non-technical stakeholders in international environments, translating complex data requirements into scalable solutions, and delivering end-to-end data products that empower teams to derive actionable insights and unlock AI opportunities.

Available to hire

I am Jose Manuel Diaz Urraco, Freelance Data Engineer since February 2025, with experience across startups, big tech, and consulting (Amazon, Slido/Cisco, Banco Santander). I design and deliver reliable batch and streaming data pipelines and modern data platforms, centralizing fragmented sources into a single source of truth, automating data quality, and enabling BI, ML, and GenAI use cases.

I excel at communicating with technical and non-technical stakeholders in international environments, translating complex data requirements into scalable solutions, and delivering end-to-end data products that empower teams to derive actionable insights and unlock AI opportunities.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert

Language

Spanish; Castilian
Fluent
English
Advanced

Work Experience

Freelance Data Engineer at Self-employed
February 1, 2025 - Present
Design and deliver automated data pipelines and architectures for international clients, mainly HealthTech and EdTech. Centralize dispersed sources into a single source of truth; automate cleaning/ validation to keep data analytics- and AI-ready. Build hybrid batch + streaming stacks using NiFi, Kafka, Spark Streaming, Airflow, dbt, Snowflake/BigQuery, S3/GCS, Dataflow/Dataproc, and Fivetran.
Product Owner (PO) & Lead Maintainer at Omdena (ULog)
September 1, 2025 - December 1, 2025
Led sprint planning / Kanban, aligned cross-functional contributors, clarified requirements, and coordinated onboarding. Established project governance and delivery standards (versioning, CONTRIBUING/PR templates, privacy policy), and improved CI/repo health with GitHub Actions. Delivered a deterministic log normalization + classification pipeline: JSON Schema contracts and validation harness, rule-based classifier with unit/E2E tests, and production components (HTTP service, Lambda packaging, alert sinks, OpenSearch dashboards); merged 28 PRs and released contracts v1.0.0.
Lead Data Engineer & Lead Backend Engineer at Omdena (CropLogic)
September 1, 2025 - November 1, 2025
Defined data strategy and stack for a precision agriculture platform using open geospatial datasets (Parquet + SQLite). Implemented geospatial ETL pipelines (ERA5-Land, NDVI, field-level features) incl. CRS standardization, grid definitions, field–cell mappings. Bootstrapped FastAPI backend + CI; defined metadata base/schema; implemented APIs (fields, ingestions, features, annotations, assessments, reset/cleaning).
SQL & Python Developer at Quental (Client: Banco Santander)
July 1, 2024 - December 1, 2024
Built and maintained SQL processes by composing PL/pgSQL functions for analytical workflows. Extracted data from SQL tables using Python (psycopg2, SQLAlchemy) to support downstream pipelines and analysis.
AI Developer at Quental (Client: Banco Santander)
February 1, 2024 - July 1, 2024
Built generative AI applications in Python using Azure OpenAI; delivered UIs in Streamlit. Used Selenium/BeautifulSoup for web scraping when required. Trained computer vision models (classification, segmentation, object detection, OCR) with TensorFlow/Keras.
Business Intelligence Engineer at Amazon (EU Supply Chain)
September 1, 2023 - December 1, 2023
Built an automated reporting system generating and distributing Excel reports via email using AWS (EC2, S3, Lambda, Glue) and internal tooling. Migrated key calculations from Excel to SQL (CTEs) and delivered a QuickSight dashboard to monitor model inputs/outputs. Refactored optimization codebase: S3/Redshift I/O, TOML configuration, parallel processing (MPire), improved code quality (docstrings, typing, Black, Flake8).
Data Scientist at Slido (Cisco)
January 1, 2023 - June 1, 2023
Fine-tuned large language models for video message summarization and evaluated candidates for production-oriented distillation.
Data Engineer at Slido (Cisco)
June 1, 2022 - December 1, 2022
Executed CRM-to-DWH migration (Gainsight → DWH + Gainsight NXT): extracted raw CRM data, modeled logic in dbt materialized views, migrated calculations back to Gainsight.
Data Developer at DBpedia
May 1, 2021 - August 1, 2021
Google Summer of Code 2021: Built “DBpedia-Spotlight Dashboard”: Bash-based extraction/transformation + Python/Plotly analytics dashboard.
Data Visualization Analyst Intern at NTT DATA (Client: Banco Santander)
February 1, 2021 - May 1, 2021
Data cleaning, quality and traceability; contributed to ETL tasks and analytical models; built dashboards in Power BI.
AI Intern at Universidad Politécnica de Madrid (UPM)
February 1, 2021 - May 1, 2021
NLP work on DBpedia Spotlight: integrated Wikidata entities (EN/ES), built evaluation datasets, and a Shiny (R) visualization app.

Education

Data Engineering Bootcamp at Le Wagon
February 1, 2025 - August 1, 2025
MSc in Data Science at Universidad Politécnica de Madrid
September 1, 2021 - June 1, 2023
BSc in Computer Engineering at Universidad Politécnica de Madrid
January 1, 2017 - June 1, 2021

Qualifications

AWS Certified Cloud Practitioner
January 11, 2030 - January 20, 2026

Industry Experience

Software & Internet, Education, Healthcare, Media & Entertainment, Professional Services

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert