I am a data engineer and analytics professional with 7+ years of experience designing data pipelines, analytical workflows, and cloud-based reporting systems. I specialize in Python, SQL, AWS, and time-series modeling to turn complex, unstructured data into reliable, production-ready datasets that support business decisions.\n\nI enjoy building reproducible data workflows, collaborating across teams, and sharing knowledge to accelerate adoption of BI and ML capabilities.

Hemanth Nagara j

I am a data engineer and analytics professional with 7+ years of experience designing data pipelines, analytical workflows, and cloud-based reporting systems. I specialize in Python, SQL, AWS, and time-series modeling to turn complex, unstructured data into reliable, production-ready datasets that support business decisions.\n\nI enjoy building reproducible data workflows, collaborating across teams, and sharing knowledge to accelerate adoption of BI and ML capabilities.

Available to hire

I am a data engineer and analytics professional with 7+ years of experience designing data pipelines, analytical workflows, and cloud-based reporting systems. I specialize in Python, SQL, AWS, and time-series modeling to turn complex, unstructured data into reliable, production-ready datasets that support business decisions.\n\nI enjoy building reproducible data workflows, collaborating across teams, and sharing knowledge to accelerate adoption of BI and ML capabilities.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert

Language

English
Fluent
French
Advanced
German
Intermediate

Work Experience

Senior Data Analyst at Makswel Pharmaceuticals
May 1, 2023 - Present
Designed and implemented a serverless data lakehouse architecture using AWS S3 and Athena, migrating legacy Excel workflows to centralized querying. Built Python-based reporting pipelines, improving data quality by 20% and automating KPI delivery via Power BI. Prepared regulatory & audit-compliant reports and automated KPI dashboards in Power BI, reducing manual reporting time by 50%. Cleaned and restructured raw datasets (sales, supply chain, product), improving data quality and ensuring schema consistency and documentation for compliance. Provided peer support and walkthroughs on BI tools, fostering knowledge sharing and adoption across the team.
Data Science Consultant at Aivariant
September 1, 2025 - Present
Delivered end-to-end analytical solutions with emphasis on data preparation, feature engineering, and reproducible modeling workflows. Built standardized pipelines covering data ingestion, cleaning, exploratory analysis (EDA), feature construction, and dataset validation to ensure consistency across use cases. Built reusable Python-based workflows to transform raw data into analysis-ready datasets, supporting downstream modeling and reporting. Benchmarked statistical, machine-learning, and deep-learning approaches with consistent evaluation metrics and backtesting strategies. Deployed data solutions for Financial Risk & Exposure Forecasting: time-series pipelines incorporating exogenous variables enabling 30-day forecasting using SARIMAX, XGBoost, and LSTM.
Entrepreneur in Residence at Antler
November 1, 2022 - April 30, 2023
As an Entrepreneur In Residence, participated in a cohort program to obtain €150,000 in pre-seed funding. Worked with co-founders to conceptualize ideas, research potential customers, gather feedback, develop a minimum viable product, and present pitches to investors. Prototyped NLP & Sentiment Analysis pipelines to extract automated insights from scientific publications for market validation.
Researcher at Michelin + Université Clermont Auvergne
September 1, 2018 - December 31, 2021
Designed and maintained scalable data processing pipelines for large-scale molecular simulation data generated on HPC clusters, using Python and Bash scripting. Planned and executed end-to-end computational workflows, balancing research objectives with industrial delivery constraints (timelines, reproducibility, performance). Processed and structured high-dimensional, GB-scale datasets, enabling efficient downstream analysis and reporting. Implemented automated data extraction and feature computation modules to quantify material properties across multiple spatial and temporal scales. Built reproducible data workflows to ensure data quality, validation, and traceability of simulation outputs. Collaborated cross-functionally with engineers, researchers, and industrial stakeholders to translate complex data outputs into actionable insights.

Education

PhD in Chemistry at Université Clermont Auvergne, France
January 1, 2018 - December 31, 2021
MSc in Materials Science at Technische Universität Freiberg, Germany
January 1, 2013 - December 31, 2017
Data Science Specialization Program at Steinbeis University, Germany
November 1, 2024 - October 1, 2025

Qualifications

Databricks: Data Engineer Associate
January 11, 2030 - January 29, 2026
SQL for Data Science (UC Davis)
January 11, 2030 - January 29, 2026
AI/BI for Data Analysts (Databricks)
January 11, 2030 - January 29, 2026
Data Modeling Strategies (Databricks)
January 11, 2030 - January 29, 2026
Data Science Workshop (Snowflake)
January 11, 2030 - January 29, 2026

Industry Experience

Healthcare, Life Sciences, Software & Internet, Professional Services