Available to hire
I’m Anushree Das, a ML Engineer and Data Scientist with 5+ years of experience building production ML systems, end-to-end data pipelines, and computer vision applications. I thrive on turning messy data into robust, scalable solutions and enjoy collaborating with cross-functional teams to deliver impact. I’m also an active researcher pursuing automated ML-driven data type inference for enterprise pipelines, and I hold an AWS Certified Cloud Practitioner credential. My work spans NLP, image classification, and similarity search across large-scale data platforms, with a focus on practical, production-ready solutions.
Skills
Language
Afar
Advanced
Amharic
Intermediate
Work Experience
Software Engineer at Arkatechture
April 1, 2024 - PresentEngineered an algorithm to automatically extract relational schemas from complex nested JSON, infer SQL data types, and create or update Snowflake tables without manual intervention, eliminating schema preparation overhead for new data sources. Designed and maintained scalable ETL ingestion pipelines for heterogeneous data sources via SFTP and APIs using Python, Snowflake, and AWS. Implemented data quality monitoring and validation across pipeline stages with error handling and alerting. Identified ML automation opportunities contributing to an active ACM research submission on automated data type inference.
Data Engineer at CloudData Technology LLC
November 1, 2023 - March 31, 2024Implemented ETL pipelines using Python, AWS, and Snowflake to process and structure large-scale datasets for downstream analytics and pattern discovery. Automated data workflows to improve scalability and reduce manual processing effort.
Software Development Engineer at Amazon Web Services
April 1, 2022 - July 31, 2023Developed a multimodal similarity search tool using image and text embeddings for content-based retrieval across millions of assets on the Computer Vision Data Platform team. Built a data distribution monitoring system to track dataset statistics, detect anomalies, and surface quality insights for ML model training workflows.
Software Engineer at Quantil Inc. / CDNetworks Group
August 1, 2021 - March 31, 2022Reduced HTTP request processing time by 20% on cache servers by migrating NGINX configuration to optimized C source code. Built a foundational NGINX module and proxy server through the full development lifecycle including version control, testing, and documentation.
Python Developer Intern at NCAIR, IIT Bombay
March 1, 2019 - July 31, 2019Researched X-ray image reconstruction algorithms using computer vision filters at a national aerospace research center. Implemented user authentication and 3D model construction features for a medical CT scan analysis platform, reducing analysis time by 10%.
Education
MS Data Analytics at McDaniel College, Westminster, MD
January 1, 2025 - May 12, 2026MS Computer Science at Rochester Institute of Technology, Rochester, NY
August 1, 2019 - December 1, 2021BS Information Technology at University of Mumbai
June 1, 2014 - May 1, 2017Qualifications
AWS Certified Cloud Practitioner
December 1, 2024 - May 12, 2026Industry Experience
Software & Internet, Professional Services, Media & Entertainment, Education, Healthcare
Skills
Hire a Data Scientist
We have the best data scientist experts on Twine. Hire a data scientist in Newark today.