About Me

Hi, I’m Vuthtyra (Teera) Yong, a recent graduate of Syracuse University with an M.S. in Applied Data Science. I’m seeking opportunities in data science, data analytics, data engineering, and related roles, with a strong interest in building data-driven and AI-powered solutions that solve real-world problems.

My experience includes projects in machine learning, natural language processing, multimodal retrieval, and data engineering, using tools such as Python, SQL, PyTorch, FastAPI, Azure, and AWS. I enjoy working across the full pipeline, from data preparation and modeling to deployment and user-facing applications. Most of all, I’m motivated by turning complex data into practical insights and systems that create meaningful impact.

Work Experience

Syracuse University

Syracuse, New York

AI Developer

April 2025 – April 2026

  • Built a full-stack AI research alert system using FastAPI, React/Vite, Azure Functions, Cosmos DB, Azure AI Search, Azure OpenAI, and Azure Communication Services Email.
  • Developed an automated recommendation pipeline that ingests arXiv preprints, enriches researcher profiles with ORCID/OpenAlex data, generates embeddings, performs vector-based matching, and sends daily email digests.
  • Designed persistence, deduplication, alert logging, and 24-hour temporary vector cleanup to reduce duplicate recommendations and control Azure AI Search storage growth.

DENSO (Cambodia) Co., Ltd.

Phnom Penh, Cambodia

Machine Learning Engineer (Internship)

May 2025 – August 2025

  • Fine-tuned pretrained YOLO and Faster R-CNN models on manufacturing image datasets for automated defect detection and quality inspection use cases.
  • Benchmarked model performance using precision, recall, mAP, and inference speed to identify the most practical solution for real-time inspection workflows.
  • Prototyped a containerized inference service using Docker and FastAPI to simulate real-time automated inspection.

Projects

Certifications and Badges

Skills

Python | SQL | R | C++ | Machine Learning & AI | Deep Learning | PyTorch | TensorFlow | Hugging Face | LangChain | RAG | Data Engineering | Apache Spark | Kafka | MongoDB | Elasticsearch | Neo4j | dbt | Data Visualization | Tableau | Power BI | Excel | Flask | FastAPI | React | Azure Cloud Technologies | AWS Cloud Technologies | Snowflake | Docker & Containers | Kubernetes | Git & GitHub | Linux