Available to hire
Hi, I’m Vegim Shala, a Data and Software Engineer with 5 years of experience specializing in big data solutions using Spark, SQL, Java, Python, and AWS. I enjoy leveraging my expertise to optimize data infrastructure and software systems, helping organizations become more efficient and data-driven.
Currently, I am pursuing a Master’s in Data Engineering and Analytics at the Technical University of Munich. I’m passionate about data analytics, machine learning, and cloud technologies, and I love mentoring teams and sharing knowledge to drive technical excellence.
Experience Level
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Language
English
Fluent
German
Intermediate
Work Experience
Data Engineer at PRIME | Retail & Trade Solutions
February 1, 2022 - PresentImplemented an end-to-end data flow framework using Spark on Java and Python in a fail-safe, fault-tolerant manner with data quality checks, achieving 40% cost reduction and 6x runtime acceleration. Led development of the ETL repository, implementing 25+ features for a drag-and-drop data platform aimed at simplifying data team workflows. Rewrote inefficient ML algorithms into optimized Spark code, reducing runtime by 10x. Maintained AWS architecture and tuned EMR instance fleets for high resource utilization. Configured S3 buckets, used Athena, Glue, and dashboards to ensure data accuracy. Implemented data validation framework and dynamic alerting systems for timely issue notifications. Integrated large language models for product matching using vector databases. Co-created documentation and onboarding programs, mentoring and upskilling cross-functional teams.
Backend Engineer (Working Student) at Maltego
July 31, 2023 - July 11, 2025Developed a simplified version of Maltego’s product aimed at users with varying technical backgrounds. Extracted unstructured data from multiple file types using Apache Tika, storing results in Elasticsearch. Created a NestJS-based framework for querying Elasticsearch.
Data Warehouse Intern -> Junior Data Integration Specialist -> Data Engineer at Raiffeisen Bank Kosovo
January 31, 2022 - July 11, 2025Migrated Raiffeisen Bank Bulgaria’s Data Warehousing system from scattered IBM DataStage jobs and manual scripts to a fully automated Oracle Data Integrator system. Optimized complex data mappings via SQL best practices and Oracle hints, maintaining data integrity with constraints and lineage mechanisms. Reduced the most complex data loading from over 2 hours to under 10 minutes and cut overall DWH processing from 9 hours to 3 hours while adding new data layers. Automated daily data loads with file polling triggers. Managed Apache Airflow pipelines to improve data lifecycle management, quality, and reliability. Developed PySpark data products for client models with scalability enhancements like partition pruning. Introduced Changed Data Capture (CDC), laying groundwork for Delta Lake migration.
Data and Software Engineer at PRIME | Retail & Trade Solutions
February 1, 2022 - PresentImplemented an end-to-end data flow framework using Spark on Java and Python with a custom orchestrator, achieving 40% cost reduction and 6x runtime acceleration. Led ETL repository development, driving ideation and implementing 25+ features for a drag-and-drop data platform. Optimized and scaled ML algorithms by rewriting Pandas code into efficient Spark code, resulting in 10x runtime reduction. Maintained and tuned AWS architecture including EMR and S3, achieving up to 85% resource utilization. Implemented data validation and dynamic alerting systems for data quality. Integrated large language models for product matching using vector databases. Co-created documentation and onboarding programs, mentoring cross-functional teams.
Backend Engineer (Working Student) at Maltego
July 31, 2023 - July 25, 2025Created a simplified version of Maltego’s product for users with differing technical backgrounds. Extracted unstructured data from various file types using Apache Tika and stored it in Elasticsearch. Implemented a framework using NestJS to query Elasticsearch.
Data Warehouse Intern → Junior Data Integration Specialist → Data Engineer at Raiffeisen Bank Kosovo
January 31, 2022 - July 25, 2025Migrated data warehousing system from IBM DataStage to Oracle Data Integrator, automating advanced data processing tasks. Optimized complex data mappings with SQL best practices and Oracle hints while ensuring data integrity with constraints and lineage. Reduced data loading times from over 2 hours to under 10 minutes, cutting overall processing time from 9 to 3 hours. Automated daily data loads with file polling triggers. Orchestrated robust data pipelines with Apache Airflow, enhancing management and quality. Developed and deployed scalable data products using PySpark. Introduced Change Data Capture implementation as groundwork for Delta Lake migration.
Education
Bachelor's Degree at University of Prishtina, Faculty of Mathematic and Natural Sciences, Department of Mathematics
October 1, 2017 - December 31, 2020Bachelor's at University of Prishtina, Faculty of Mathematic and Natural Sciences, Department of Mathematics
October 1, 2017 - December 31, 2020Qualifications
Industry Experience
Software & Internet, Financial Services, Retail, Professional Services
Experience Level
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Intermediate
Hire a Developer
We have the best developer experts on Twine. Hire a developer in München today.