Hi, I'm Yashika Sharma, a Senior Data Engineer with over five years of experience designing and building robust data pipelines, real-time processing systems, and scalable analytics platforms. I enjoy owning the entire data lifecycle from ingestion to insights, and lately I've been focusing on integrating LLM-powered search and AI agent-based systems into production with a strong emphasis on performance and clean architecture. I have a strong background in backend engineering, which helps me design cohesive and scalable data platforms that are built for production. I love working with new AI technologies and optimizing data workflows to help businesses make smarter decisions.

Yashika Sharma

Hi, I'm Yashika Sharma, a Senior Data Engineer with over five years of experience designing and building robust data pipelines, real-time processing systems, and scalable analytics platforms. I enjoy owning the entire data lifecycle from ingestion to insights, and lately I've been focusing on integrating LLM-powered search and AI agent-based systems into production with a strong emphasis on performance and clean architecture. I have a strong background in backend engineering, which helps me design cohesive and scalable data platforms that are built for production. I love working with new AI technologies and optimizing data workflows to help businesses make smarter decisions.

Available to hire

Hi, I’m Yashika Sharma, a Senior Data Engineer with over five years of experience designing and building robust data pipelines, real-time processing systems, and scalable analytics platforms. I enjoy owning the entire data lifecycle from ingestion to insights, and lately I’ve been focusing on integrating LLM-powered search and AI agent-based systems into production with a strong emphasis on performance and clean architecture.

I have a strong background in backend engineering, which helps me design cohesive and scalable data platforms that are built for production. I love working with new AI technologies and optimizing data workflows to help businesses make smarter decisions.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate

Language

English
Fluent

Work Experience

Senior Data Engineer at Nortal
August 1, 2023 - Present
Leading development of “Tarko,” an LLM-powered chatbot using Azure AI Foundry and multi-agent orchestration to automate user support and streamline information access. Built scalable batch and streaming ETL pipelines using S3, PySpark, Delta Lake, and Kafka for financial data, reducing latency and enabling near real-time insights via BigQuery and Looker. Deployed intelligent search on Azure with OpenSearch, AKS, and hybrid reranking, improving retrieval quality and system scalability. Tuned OpenSearch ingestion pipelines and joint indices for faster indexing and better performance in production. Automated data refresh workflows with Azure Logic Apps and supported Power BI reporting, reducing manual work and improving data timeliness.
Backend and Data Engineer at Alvin.ai
July 31, 2023 - July 26, 2025
Developed a data governance engine with manual and automated tagging, supporting business glossary integration across UI, CLI, and API layers. Built and released a CLI (PyPI) using Typer and FastAPI, used by top 3 enterprise customers to manage metadata, trigger dbt actions, and surface regression reports on PRs. Improved impact analysis and lineage computation, reducing execution time by 30% and enabling column-level visibility. Extended the automated data catalog with manual lineage features to improve support for custom pipelines. Created onboarding and admin tools to streamline customer setup, reducing handoff time for sales and support teams.
Software Engineer at Quine
September 30, 2021 - July 26, 2025
Built an automated pipeline with Django and PostgreSQL to capture and store daily log data with sub-second latency. Collected and normalized data from 95K+ blockchain projects across CoinMarketCap and V.Cent.co for internal analysis. Integrated Typeform API to aggregate multi-form responses into AWS RDS, improving data consolidation and accessibility. Developed an Auth0-based authentication system to capture user metadata and support a proof-of-concept for Git-linked NFTs.
Software Developer at Major League Hacking
August 31, 2020 - July 26, 2025
Contributed to 3 open-source ML/data science projects, now downloaded over 10M times globally. Published a BentoML deployment guide for SQL Server ML Services, referenced by 1.9K+ engineers. Added a clip feature to scikit-learn’s MinMaxScaler, improving control over transformed feature ranges. Authored an LSTM time series forecasting tutorial on keras.io simplifying weather prediction workflows.

Education

Bachelor of Technology at Guru Gobind Singh Indraprastha University, New Delhi, India
January 1, 2017 - July 1, 2021

Qualifications

Microsoft AZ-900
January 1, 2023 - December 31, 2023
Microsoft DP-203
January 1, 2023 - December 31, 2023
Snowflake COF-C02
January 1, 2023 - December 31, 2023

Industry Experience

Software & Internet, Financial Services, Professional Services

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Intermediate
Intermediate
Intermediate
Intermediate