I'm Priyanka Marihal, a Data Scientist with 6+ years of experience in machine learning, big data analytics, and production-grade data solutions. I specialize in Python, PySpark, Azure Databricks, MLflow, Delta Lake, and Generative AI including Retrieval-Augmented Generation (RAG) and LLM integration. I've built predictive models and AI applications that improve operational efficiency and customer experience across industries, including automotive and manufacturing. I'm based in Germany with a valid work permit.

Priyanka Marihal

I'm Priyanka Marihal, a Data Scientist with 6+ years of experience in machine learning, big data analytics, and production-grade data solutions. I specialize in Python, PySpark, Azure Databricks, MLflow, Delta Lake, and Generative AI including Retrieval-Augmented Generation (RAG) and LLM integration. I've built predictive models and AI applications that improve operational efficiency and customer experience across industries, including automotive and manufacturing. I'm based in Germany with a valid work permit.

Available to hire

I’m Priyanka Marihal, a Data Scientist with 6+ years of experience in machine learning, big data analytics, and production-grade data solutions. I specialize in Python, PySpark, Azure Databricks, MLflow, Delta Lake, and Generative AI including Retrieval-Augmented Generation (RAG) and LLM integration.

I’ve built predictive models and AI applications that improve operational efficiency and customer experience across industries, including automotive and manufacturing. I’m based in Germany with a valid work permit.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert

Language

English
Fluent
Hindi
Fluent
Kannada
Fluent
German
Beginner

Work Experience

AI/ML Computational Science Senior Analyst at Accenture
September 1, 2021 - Present
Led development of TalentMatch RAG Assistant for LLM-powered candidate-to-job matching; designed end-to-end embedding and indexing pipeline, storing embeddings in Databricks Vector Search for semantic retrieval; implemented similarity search and prompt orchestration to generate explainable recommendations (skills, gaps, match score) and accelerate recruiter decisions. Also contributed to CNH Industrial/UCR Analytics by building predictive models on vehicle telemetry to reduce downtime, optimized Spark jobs with Delta Lake caching and partition pruning to cut cloud costs by ~25% and runtime by ~30%, and supported data governance migrations to Unity Catalog. Worked on Metaverse Chatbot as Senior Developer, implementing NLP-based chat flows and integration with GCP services.
Consultant at Capgemini
November 1, 2019 - September 1, 2021
Defined hotspot detection approach for Michelin’s large-scale data; read and processed Parquet data to extract features; performed preprocessing and quality checks; trained Isolation Forest and clustering models for anomaly/hotspot detection; validated model performance and prepared it for production deployment.

Education

Master of Science (Computer Science) at Visvesvaraya Technological University, India
August 1, 2013 - June 1, 2015
Bachelor of Science (Computer Science) at Visvesvaraya Technological University, India
June 1, 2009 - June 1, 2013

Qualifications

Generative AI Engineering with Databricks
January 11, 2030 - February 5, 2026

Industry Experience

Professional Services, Software & Internet, Manufacturing