Available to hire
I am a hands-on AI/ML engineer with 9+ years building enterprise-grade AI systems, specializing in CCaaS, conversational AI, and Copilot-style assistants. I design scalable NLP/LMM-based pipelines, RAG solutions, and explainable AI to improve customer experience, compliance, and predictive automation.
I excel at translating business needs into robust ML/NLP platforms, deploying models as microservices, and leading end-to-end ML lifecycle workflows across cloud environments (AWS, Azure, GCP).
Experience Level
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Language
English
Fluent
Work Experience
Applied Scientist – Generative AI at Global Atlantic
February 1, 2025 - PresentDesigned and productionized GPT-4 powered RAG systems for real-time policy explanation, regulatory summarization, and advisor-assist copilots across life insurance and annuity domains. Built retrieval-augmented pipelines using LangChain, LlamaIndex, and FAISS; deployed on AWS SageMaker and Azure ML. Architected large-scale data processing with PySpark on Hadoop (YARN/HDFS/Hive) and established scalable AWS data pipelines (S3/EMR/Lambda). Prototyped cross-platform interfaces (Streamlit) and FastAPI-based backends for OCR-enabled summarization; automated infrastructure with Terraform and Cloud Composer. Explored RLHF-driven risk explanations to guide output preferences in regulated contexts.
AI Data Solutions Developer at Freedom Mortgage
January 1, 2025 - September 8, 2025Built Intelligent Document Processing for mortgage automation by fine-tuning domain-adapted BERT/RoBERTa/DistilBERT models for automated classification and PII detection. Implemented Kubernetes-based compute grids (Apache Ignite) for high-throughput inference, automated workflows with Power Automate, and Power BI dashboards for executives. Leveraged Databricks + Spark on Azure Synapse for analytics, and serverless classification via AWS Lambda/SageMaker. Established CI/CD with Azure DevOps and containerized microservices with Docker and Flask/FastAPI backends; implemented OCR+NLP pipelines and Terraform-based infrastructure provisioning.
Applied Scientist – NLP/ML at Virtusa Corporation (Fidelity – Boston, MA, US)
September 1, 2023 - September 8, 2025Fine-tuned BERT, RoBERTa, and DistilBERT for extractive and abstractive summarization of equity research and investment commentary. Built custom NER pipelines, PySpark-based NLP/ML services, and banking-focused AI solutions including sentiment analysis, fraud detection, and risk scoring. Integrated Flutter-based mobile interfaces, Azure AI Services with Power Platform, and Copilot Studio for conversational workflows. Containerized OCR/NLP microservices on Azure ML with CI/CD via Jenkins and GitHub Actions; enabled governance and reproducibility with MLflow and DVC.
Data Engineer – NLP Focus at Happiest Minds (App Annie – San Francisco, US)
May 1, 2021 - September 8, 2025Parsed semi-structured logs to support NLP classification pipelines. Built alert classifiers (TF-IDF, SVM, Logistic Regression) for incident triage; applied SpaCy NER and custom patterns to extract error codes and traces. Used LDA and BERTopic for topic modeling, developed token features, and implemented rule-based sentiment scoring. Automated ingestion of logs into NLP pipelines, integrated outputs with ticketing systems, and maintained CI/CD and containerized microservices. Worked in Linux/WSL environments with emphasis on reproducibility.
Associate Data Analyst at Fractal Analytics
August 1, 2018 - September 8, 2025Built Python pipelines (Pandas, NumPy) to process time-series metrics and structured logs for downstream ML modeling. Developed anomaly detection with scikit-learn and XGBoost, applied ARIMA and Prophet for forecasting, and implemented incident classification with Logistic Regression and Random Forest. Performed log pattern extraction via regex, BERTopic, and TF-IDF; connected MySQL/PostgreSQL for exploration; tuned models with grid search; produced visualizations with matplotlib/seaborn. Maintained reproducible workflows with Git and Linux/WSL.
Applied Scientist – Generative AI at Global Atlantic
February 1, 2025 - PresentDesigned and productionized GPT-4 powered RAG systems for real-time policy explanation, regulatory summarization, and advisor copilots across life insurance and annuity domains. Implemented retrieval-augmented generation pipelines using LangChain, LlamaIndex, and FAISS to ground responses over insurance compliance documents and product manuals. Deployed GPT-4 powered RAG systems on AWS SageMaker and Azure ML, delivering measurable improvements in compliance automation and policy summarization. Built scalable data pipelines (PySpark on Hadoop clusters with YARN/HDFS/Hive; AWS S3/EMR/Lambda) and developed FastAPI microservices for OCR-enabled document processing backends. Supported CI/CD with Azure DevOps and Docker. Prototyped RLHF-guided risk explanations and annotated token-level traces for outputs.
AI Data Solutions Developer at Freedom Mortgage
January 31, 2025 - September 8, 2025Led Intelligent Document Processing (IDP) for mortgage automation; fine-tuned domain-adapted BERT, RoBERTa, and DistilBERT for automated classification and summarization of mortgage documents. Built and deployed ML models via FastAPI/Flask; containerized on Kubernetes with Docker; automated CI/CD with Jenkins and GitHub Actions. Integrated with Dataverse, SQL, and SharePoint; built Power BI dashboards; automated AI-driven decision points; OCR-to-ML data pipelines. Leveraged AWS Lambda/SageMaker for serverless classification and deployed data pipelines on GCP (BigQuery, Dataflow). Implemented Terraform for infra provisioning and Cloud Composer workflows.
Applied Scientist – NLP/ML at Virtusa Corporation (Fidelity – Boston, MA, US)
September 30, 2023 - September 8, 2025Fine-tuned BERT, RoBERTa, and DistilBERT via Hugging Face Transformers for extractive and abstractive summarization of financial reports. Built custom NER pipelines with spaCy; engineered PySpark-based services for large-scale NLP workloads; integrated with mainframe-to-cloud data interfaces and CI/CD pipelines. Built Copilot Studio conversational forms and chatbots; containerized OCR/NLP microservices on Azure ML; automated ML lifecycle with MLflow and DVC; collaborated with research teams to productionize ML innovations. Deployed NLP services via Docker on Kubernetes; implemented explainability aids (SHAP, Integrated Gradients) and monitoring.
Data Engineer – NLP Focus at Happiest Minds
May 31, 2021 - September 8, 2025Parsed semi-structured logs for NLP classification; built alert classifiers using TF-IDF, SVM, and Logistic Regression for incident triage. Applied SpaCy NER and rule-based patterns for entity extraction; developed topic modeling with LDA and BERTopic; integrated outputs with ticketing systems. Built token features (n-grams, Word2Vec) for log similarity scoring and clustering; deployed Dockerized OCR/NLP microservices and automated CI/CD pipelines. Supported data processing with Databricks + Spark; designed data mining architectures for analytics-as-a-service.
Associate Data Analyst at Fractal Analytics
August 31, 2018 - September 8, 2025Built Python pipelines with Pandas/NumPy to process time-series metrics and structured logs for ML modeling. Developed anomaly detection models (scikit-learn, XGBoost) and time-series forecasts (ARIMA, Prophet). Implemented classification (Logistic Regression, Random Forest) for incident routing, and mined patterns from unstructured logs using BERTopic, TF-IDF, and regex. Connected MySQL/PostgreSQL databases to analytical notebooks and versioned notebooks/scripts with Git. Trained and evaluated models in Linux/WSL environments with CI-ready workflows.
Applied Scientist – Generative AI at Global Atlantic
February 1, 2025 - PresentDesigned and productionized GPT-4 powered RAG systems for real-time policy explanation, regulatory summarization, and advisor-assisted copilots across life insurance and annuity domains. Built retrieval-augmented generation pipelines using LangChain, LlamaIndex, and FAISS, integrating vector retrieval with GPT-4 for grounded responses over insurance compliance and product manuals. Deployed these systems on AWS SageMaker and Azure ML, delivering scalable AI capabilities. Developed PySpark jobs on Hadoop clusters handling multi-terabyte datasets; built distributed data pipelines on AWS (S3/EMR/Lambda); prototyped demo interfaces with Streamlit; explored RLHF techniques for risk-aware explanations.
AI Data Solutions Developer at Freedom Mortgage
January 31, 2025 - September 8, 2025Led Intelligent Document Processing (IDP) for mortgage automation: fine-tuned domain-adapted BERT/RoBERTa/DistilBERT for automated classification and summarization of mortgage documents; implemented OCR-to-ML pipelines; exposed APIs via FastAPI/Flask for document understanding; built compute grids on Kubernetes with Apache Ignite; automated CI/CD with GitHub Actions; created Power BI dashboards; leveraged Azure AI and AWS for scalable deployment; implemented Terraform-based infrastructure; conducted tokenizer and embedding validation with PyTest.
Applied Scientist – NLP/ML at Virtusa Corporation (Fidelity – Boston, MA, US), Mumbai
September 30, 2023 - September 8, 2025Fine-tuned BERT/RoBERTa/DistilBERT for extractive and abstractive summarization of equity research reports; built NER pipelines; engineered PySpark-based NLP services for large-scale workloads on AWS; developed banking AI solutions (sentiment, fraud, risk scoring); built Copilot Studio chatbots; integrated mainframe pipelines with cloud; dockerized OCR/NLP microservices; used DVC/MLflow for reproducibility; designed document retrieval and reasoning pipelines; implemented end-to-end ML lifecycle.
Data Engineer – NLP Focus at Happiest Minds (App Annie – San Francisco, US)
May 31, 2021 - September 8, 2025Processed semi-structured logs for NLP classification; built alert classifiers using TF-IDF, SVM, and Logistic Regression; applied SpaCy NER and rule-based patterns; engineered token features; used LDA/BERTopic for topic modeling; implemented rule-based sentiment scoring; trained classifiers with scikit-learn; integrated with Flutter mobile apps; deployed OCR/NLP microservices in Docker; automated CI/CD with Azure DevOps; built data pipelines with Databricks + Spark interfacing with Azure Synapse.
Associate Data Analyst at Fractal Analytics
August 31, 2018 - September 8, 2025Built Python pipelines to process time-series metrics and logs for ML modeling; developed anomaly detection models; applied ARIMA/Prophet forecasting; implemented IT incident classification; extracted patterns from unstructured logs with BERTopic/TF-IDF; connected MySQL/PostgreSQL; versioned notebooks with Git; built CI-ready workflows; supported system monitoring by modeling degradation signals.
Applied Scientist – Generative AI at Global Atlantic
February 1, 2025 - PresentDesigned and productionized GPT-4 powered RAG systems for real-time policy explanation, regulatory summarization, and advisor copilots across life insurance and annuity domains. Built retrieval-augmented generation pipelines using LangChain, LlamaIndex, and FAISS to ground responses over insurance compliance and product manuals. Deployed on AWS SageMaker and Azure ML to enable scalable, compliant inference. Developed PySpark data pipelines on Hadoop clusters for multi-terabyte datasets and integrated S3/EMR/Lambda for scalable ingestion. Built FastAPI-based backends exposing OCR and LLM-powered summarization services, and prototyped real-time demo interfaces with Streamlit. Implemented automated CI/CD with GitHub Actions and Terraform for reproducible deployments. Led evaluation and tracing with LangSmith and MLflow for token-level traceability and prompt testing.
AI Data Solutions Developer at Freedom Mortgage
January 1, 2025 - September 8, 2025AI Data Solutions Developer focused on Intelligent Document Processing for mortgage workflows. Fine-tuned domain-adapted BERT/RoBERTa/DistilBERT for automated classification and summarization of mortgage documents; implemented OCR + NLP pipelines; containerized microservices with FastAPI/Flask; built Kubernetes-based compute grid with Apache Ignite to reduce latency; automated end-to-end workflows with Power Automate; dashboards with Power BI; integrated AI Builder models; DevOps with Azure DevOps/Jenkins; Docker-based services; built data pipelines on AWS/Azure.
Applied Scientist – NLP/ML at Virtusa Corporation (Fidelity – Boston, MA, US)
September 1, 2023 - September 8, 2025Fine-tuned BERT, RoBERTa, and DistilBERT for extractive and abstractive summarization of equity research reports and investment commentaries. Built custom NER pipelines; engineered PySpark-based services for large-scale NLP workloads on AWS clusters; integrated with Dataverse and Azure AI services; contributed to Copilot Studio-style chatbots and policy advisory copilots. Containerized with Docker; tracked experiments with MLflow and DVC; established CI/CD for ML workflows and implemented data interfaces between mainframe and distributed/cloud systems.
Data Engineer – NLP Focus at Happiest Minds (App Annie – San Francisco, US)
May 1, 2021 - September 8, 2025Parsed semi-structured logs for NLP classification pipelines; built alert classifiers using TF-IDF, SVM, and Logistic Regression; applied SpaCy NER and rule-based patterns; executed topic modeling with LDA and BERTopic; implemented log similarity scoring and clustering; integrated NLP outputs with ticketing systems; versioned pipelines with Git; designed OCR/NLP pipelines for document ingestion and automation.
Associate Data Analyst at Fractal Analytics
August 1, 2018 - September 8, 2025Built Python pipelines with Pandas/NumPy to process time-series metrics and structured system logs for downstream ML modeling. Developed anomaly detection models using scikit-learn and XGBoost; applied ARIMA and Prophet for time-series forecasting; automated incident classification and ticket routing; extracted patterns from unstructured logs using regex, BERTopic, and TF-IDF; connected MySQL and PostgreSQL databases for exploratory analysis and reproducibility.
Applied Scientist – Generative AI at Global Atlantic
February 1, 2025 - PresentDesigned and productionized GPT-4 powered RAG systems for real-time policy explanation, regulatory summarization, and advisor copilots across life insurance and annuity domains. Built retrieval-augmented generation pipelines with LangChain, LlamaIndex, and FAISS; deployed on AWS SageMaker and Azure ML for scalable grounded responses. Implemented PySpark data processing on Hadoop clusters and developed scalable data pipelines on AWS (S3, EMR, Lambda). Built FastAPI-based OCR & NLP backends; prototyped RLHF-guided risk explanations to guide output preferences. Enabled end-to-end governance with MLflow/DVC and GitHub Actions for reproducible model lifecycles.
AI Data Solutions Developer at Freedom Mortgage
January 31, 2025 - September 8, 2025Built and deployed Intelligent Document Processing (IDP) for mortgage automation, fine-tuned domain-adapted BERT/RoBERTa/DistilBERT for document classification and PII detection, and operationalized NLP pipelines as APIs (FastAPI/Flask). Implemented Kubernetes compute grids with Apache Ignite; automated CI/CD with Docker, Jenkins, and cloud integration (Azure/AWS). Developed document intelligence pipelines (OCR + NLP) and Power Platform solutions for enterprise workflows; delivered scalable consented analytics.
Applied Scientist – NLP/ML at Virtusa Corporation (Fidelity – Boston, MA, US), Mumbai
September 1, 2023 - September 8, 2025Fine-tuned BERT/RoBERTa/DistilBERT for extractive/abstractive summarization of equity research; built NER pipelines; engineered PySpark services for large-scale NLP workloads; integrated Azure AI Services with Power Platform; contributed to Copilot Studio conversational forms; collaborated with business leaders to translate requirements into scalable AI solutions; supported cross-functional integration of mainframe and cloud systems.
Data Engineer – NLP Focus at Happiest Minds (App Annie – San Francisco, US), Bengaluru
May 1, 2021 - September 8, 2025Parsed semi-structured logs for NLP classification; built alert classifiers (TF-IDF, SVM, Logistic Regression); leveraged SpaCy NER and BERTopic/LDA for topic modeling; implemented log similarity and clustering; automated end-to-end workflows; integrated with Databricks/Spark in Azure Synapse; deployed OCR/NLP microservices with CI/CD.
Associate Data Analyst at Fractal Analytics
August 1, 2018 - September 8, 2025Built Python pipelines for time-series metrics and logs; developed anomaly detection models (scikit-learn XGBoost); applied ARIMA/Prophet for forecasting; performed log pattern mining with BERTopic/TF-IDF; connected MySQL and PostgreSQL; versioned notebooks with Git; implemented reproducible pipelines for incident analysis.
Applied Scientist – Generative AI at Global Atlantic
February 1, 2025 - PresentDesigned and productionized GPT-4 powered RAG systems for real-time policy explanation, regulatory summarization, and advisor copilots across life insurance and annuity domains. Built retrieval-augmented generation pipelines with LangChain, LlamaIndex, and FAISS; deployed on AWS SageMaker and Azure ML; designed data pipelines in PySpark on Hadoop; implemented OCR/NLP document processing and exposed APIs via FastAPI/Docker; automated CI/CD and infrastructure as code.
AI Data Solutions Developer at Freedom Mortgage
January 1, 2025 - September 8, 2025Built and operationalized scalable ML services for document understanding, PII detection, and loan decisioning; fine-tuned domain-adapted models (BERT/RoBERTa/DistilBERT) for automated classification and summarization; deployed NLP services as API microservices; implemented scalable data pipelines (AWS S3/EMR/Lambda) and low-code automation; led document intelligence workflows (OCR + NLP) and compliant data processing.
Applied Scientist – NLP/ML at Virtusa Corporation (Fidelity)
September 1, 2023 - September 8, 2025Fine-tuned BERT/RoBERTa/DistilBERT for extractive/abstractive summarization of equity research reports; built custom NER pipelines; engineered PySpark-based NLP services; integrated Azure AI services with Power Platform for semantic search and sentiment analysis; implemented Copilot Studio chatbots and low-code intelligent workflows.
Data Engineer – NLP Focus at Happiest Minds (App Annie)
May 1, 2021 - September 8, 2025Developed NLP-driven document processing and log analytics pipelines; parsed semi-structured logs, built alert classifiers with TF-IDF/SVM/Logistic Regression, applied NER for entity extraction, and performed topic modeling with LDA/BERTopic; dockerized microservices and orchestrated deployments on Kubernetes; designed data pipelines for analytics and compliance.
Associate Data Analyst at Fractal Analytics
August 1, 2018 - September 8, 2025Built Python data pipelines for time-series metrics and logs; developed anomaly detection with scikit-learn and XGBoost; applied ARIMA/Prophet for forecasting; implemented incident classification and log clustering; integrated with MySQL/PostgreSQL and versioned notebooks for reproducibility.
Education
Qualifications
Industry Experience
Financial Services, Software & Internet, Professional Services, Other, Computers & Electronics, Media & Entertainment
Experience Level
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Expert
Hire a Data Scientist
We have the best data scientist experts on Twine. Hire a data scientist in New York today.