I’m a passionate data and software engineer who loves to learn and solve real-world problems at the intersection of data, code, and machine learning. I enjoy exploring how modern DL models, RAG systems, and transformer architectures can unlock value across industries, with a current focus on deep learning, medical image computing, and practical ML deployments. I thrive in building end-to-end systems that scale, from data pipelines to AI-powered applications. In my work, I design robust data platforms and ML-enabled services, combining cloud and on‑prem environments to deliver seamless, scalable solutions. I’m motivated by hands-on experimentation and real-world impact, always looking to level up with new tools and approaches while keeping the focus on maintainable, production-ready architectures.

Javier Garcia Garcia

I’m a passionate data and software engineer who loves to learn and solve real-world problems at the intersection of data, code, and machine learning. I enjoy exploring how modern DL models, RAG systems, and transformer architectures can unlock value across industries, with a current focus on deep learning, medical image computing, and practical ML deployments. I thrive in building end-to-end systems that scale, from data pipelines to AI-powered applications. In my work, I design robust data platforms and ML-enabled services, combining cloud and on‑prem environments to deliver seamless, scalable solutions. I’m motivated by hands-on experimentation and real-world impact, always looking to level up with new tools and approaches while keeping the focus on maintainable, production-ready architectures.

Available to hire

I’m a passionate data and software engineer who loves to learn and solve real-world problems at the intersection of data, code, and machine learning. I enjoy exploring how modern DL models, RAG systems, and transformer architectures can unlock value across industries, with a current focus on deep learning, medical image computing, and practical ML deployments. I thrive in building end-to-end systems that scale, from data pipelines to AI-powered applications.

In my work, I design robust data platforms and ML-enabled services, combining cloud and on‑prem environments to deliver seamless, scalable solutions. I’m motivated by hands-on experimentation and real-world impact, always looking to level up with new tools and approaches while keeping the focus on maintainable, production-ready architectures.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
See more

Language

Spanish; Castilian
Fluent
English
Advanced

Work Experience

Data Engineer at Telefonica TID
April 1, 2025 - Present
Developed and maintained scalable ETL pipelines to process large-scale SIM card activity data in IoT projects, leveraging Apache Spark and Scala for advanced analytics and real-time data processing. Automated infrastructure deployment and configuration management using Ansible to provide robust and consistent environments across projects. Used Python for development of ETL processes, DAGs for workflow orchestration, and data automation scripts, improving efficiency and maintainability of data pipelines. Worked extensively with Google Cloud Platform services (Cloud Storage, cloud clusters, BigQuery), optimizing storage, orchestration of distributed workloads, and integrating cloud and on‑premise systems for seamless big data solutions.
CoLead Data & AI Engineer at MVP Analytics (Freelance) - Generative Content Platform
July 1, 2025 - Present
Data & AI Engineering lead for an ETL-driven system ingesting, transforming, and consolidating data from multiple sources. Architected ETL pipeline orchestrated with Apache Airflow; multi-database setup: ClickHouse for analytical queries, PostgreSQL for metadata and transactional operations, MinIO for object storage, ChromaDB for vector embeddings, leveraging LLMs hosted on AWS with an on‑premise data flow to achieve a seamless RAG-enabled generative platform. Base model LLama 3.1 8B fine-tuned with LoRA adapters and 4-bit quantization to optimize GPU memory and performance. Backend & API layer built with FastAPI; JWT authentication, TOT P-based 2FA, and RBAC; microservices architecture and asynchronous workflows. Frontend integrated with Next.js 15 and TypeScript. Entire solution runs on multi-service Docker Compose setup; persistent volumes and health checks, using Redis for caching, MinIO for storage, and VastAI for GPU inference. End-to-end deployment pipeline.
Data Engineer at Nubika Cloud Solutions
March 1, 2024 - October 16, 2025
Developed data sets and data flows on Salesforce Einstein Analytics (SAQL, SQL). Built a data warehouse and data lake using MuleSoft on Amazon S3 and Amazon Redshift services.
Software Developer at Sisytec Networks
November 1, 2022 - October 16, 2025
Development of API, JWT authentications and user interfaces for web environments (PHP, Node.js, Angular, jQuery, CSS, SQL).

Education

Data Science Degree at Universitat Oberta de Catalunya
February 1, 2022 - September 1, 2026
Web Applications Development Associate Degree (CFGS) at IES Josep Pla
September 1, 2019 - December 1, 2021

Qualifications

Microsoft Azure AI-900
January 11, 2030 - October 16, 2025
Databricks Generative AI Fundamentals and Databricks Lakehouse Fundamentals
January 11, 2030 - October 16, 2025

Industry Experience

Telecommunications, Software & Internet, Professional Services

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
See more