I'm Vineeth Malyala, a seasoned AI/ML engineer and data scientist with 10+ years of experience designing, architecting, and delivering enterprise-grade AI, ML, and GenAI solutions across finance, healthcare, retail, and technology domains. I thrive on building robust, scalable systems that turn data into business impact, from predictive analytics to conversational AI and valuable ML-powered products. I specialize in GenAI & LLMs, prompt engineering, RAG pipelines, and production-grade ML platforms. I enjoy collaborating with clinicians, policy experts, product owners, and engineering teams to refine requirements, optimize performance, ensure governance and compliance, and deliver measurable improvements with clear documentation and governance.

Vineeth Malyala

I'm Vineeth Malyala, a seasoned AI/ML engineer and data scientist with 10+ years of experience designing, architecting, and delivering enterprise-grade AI, ML, and GenAI solutions across finance, healthcare, retail, and technology domains. I thrive on building robust, scalable systems that turn data into business impact, from predictive analytics to conversational AI and valuable ML-powered products. I specialize in GenAI & LLMs, prompt engineering, RAG pipelines, and production-grade ML platforms. I enjoy collaborating with clinicians, policy experts, product owners, and engineering teams to refine requirements, optimize performance, ensure governance and compliance, and deliver measurable improvements with clear documentation and governance.

Available to hire

I’m Vineeth Malyala, a seasoned AI/ML engineer and data scientist with 10+ years of experience designing, architecting, and delivering enterprise-grade AI, ML, and GenAI solutions across finance, healthcare, retail, and technology domains. I thrive on building robust, scalable systems that turn data into business impact, from predictive analytics to conversational AI and valuable ML-powered products.

I specialize in GenAI & LLMs, prompt engineering, RAG pipelines, and production-grade ML platforms. I enjoy collaborating with clinicians, policy experts, product owners, and engineering teams to refine requirements, optimize performance, ensure governance and compliance, and deliver measurable improvements with clear documentation and governance.

See more

Experience Level

Expert
Expert
Expert
Expert
Expert
Expert
Expert

Work Experience

Gen AI/Machine Learning Engineer at CVS Health
July 1, 2024 - Present
Designed an LLM-powered clinical summarization assistant using LangChain and AWS, cutting review time by 35%. Architected a HIPAA-compliant RAG platform with FastAPI microservices, vector search, PHI controls, and audit-ready logging. Built ingestion connectors for EHR, claims, and PDFs via APIs, S3, and SQL sources with incremental loads. Implemented transformation pipelines in PySpark/Databricks and pandas, standardizing codes, de-duping records, and enriching metadata. Modeled and stored curated datasets in Snowflake/PostgreSQL with partitioning and clustering for analytics and training. Integrated vector stores (FAISS/Pinecone) and metadata schemas to support semantic retrieval, filtering, and traceability. Selected domain-tuned transformer models including BERT and T5, plus frontier LLMs, to deliver summarization, classification, and conversational assistance with guardrails, safety controls, evaluation metrics, and governance. Implemented RAG, entity extraction, and risk scoring
Machine Learning/ Artificial Intelligence Engineer at Wells Fargo
April 1, 2022 - June 1, 2024
Delivered GenAI financial intelligence platforms using LLMs, RAG, Python, LangChain, and cloud-native ML stacks for fraud detection, compliance automation, and personalized banking journeys. Designed secure cloud-native microservices architectures integrating ML models, LLM-driven NLP services, RAG pipelines, ETL platforms, and analytics dashboards for scalable public-sector workloads. Led delivery in bi-weekly sprints, Agile/Scrum partnering with Product Owners, policy experts, and IT leadership to prioritize backlogs and modernization outcomes. Built ingestion pipelines connecting SQL Server, Hadoop ecosystems (HDFS, Hive, Pig), cloud storage, and enterprise systems to onboard structured and unstructured datasets. Implemented scalable transformations and feature engineering using PySpark, Apache Spark MLlib, Databricks, Pandas, and Airflow to prepare analytics, ML, and compliance reporting outputs. Built and maintained warehouses using Snowflake SnowSQL and executed migrations from o
Senior Data Scientist at LAUSD
August 1, 2019 - March 1, 2022
Built an AWS-native conversational Q&A and summarization platform using SageMaker, EMR (Spark), S3, OpenSearch, API Gateway, and Lambda for retirement and asset-management operations teams, cutting manual research time by ~35%. Designed retrieval-augmented, microservices-style architecture with a data lake, EMR/Glue ETL, SageMaker training/inference endpoints, OpenSearch hybrid search, and serverless APIs secured via IAM/KMS. Ingested data from REST APIs, retirement/plan administration systems, S3 document repositories, and SQL sources, engineering PySpark + Pandas/NumPy pipelines to cleanse, tokenize, and PII-mask policy text and transactional records. Implemented a semantic retrieval layer using Sentence-BERT embeddings, FAISS indexes, and OpenSearch hybrid search to improve policy lookups. Fine-tuned Transformer models (BERT, RoBERTa, T5, DistilBERT) for Q&A, classification, and summarization, orchestrating retrieval-augmented flows that wire FAISS/OpenSearch with model endpoints. D
Data Engineer / Machine Learning Engineer at Crocs
May 1, 2017 - July 1, 2019
Collaborated with cross-functional teams to design and implement ML solutions for retail operations, customer experience, and supply chain optimization. Built predictive models for sales forecasting, demand planning, and inventory optimization, achieving high accuracy. Developed customer segmentation, personalization, and recommendation models to improve engagement and conversion. Designed and deployed NLP models to analyze customer feedback, product reviews, and social media sentiment. Applied supervised, unsupervised, reinforcement learning, and deep learning techniques for pricing, promotions, logistics, churn analysis, and demand prediction. Built scalable ML pipelines using Python, R, and Spark to power analytics and recommendations, and created ETL workflows with SQL, Spark, and Airflow for end-to-end data workflows. Built dashboards with Tableau and Python visualizations to track KPIs and ROI. Mentored junior analysts and fostered best practices in ML, NLP, and data visualizatio
Data Analyst at Cipla
August 1, 2014 - January 1, 2017
Acted as a business consultant in healthcare analytics, leveraging data-driven insights to influence strategy, improve sales forecasting, and guide market expansion decisions across pharma operations. Used Google Cloud Platform (BigQuery) and TensorFlow for large-scale pharma dataset management, prescription analysis, and drug demand forecasting. Designed scalable ETL workflows with PySpark, Airflow, and SQL for ingestion and transformation of clinical and market datasets. Built dashboards and visualizations to monitor drug sales trends, track KPIs, and manage inventory levels. Developed data preprocessing pipelines to clean and validate sensitive pharma data, ensuring regulatory compliance and supporting client engagements with PoCs and technical demonstrations.

Education

Bachelor's in Computer Science at Avanthi Engineering College
August 1, 2010 - April 1, 2014

Qualifications

Add your qualifications or awards here.

Industry Experience

Healthcare, Financial Services, Education, Retail, Software & Internet