Tai Nguyen Phu - AI Engineer Intern

Tai Nguyen Phu

Exploring the Frontiers of AI and LLMs

PDF icon View My Resume

About

๐Ÿ‘‹ Hi, I'm Tai Nguyen Phu — a final-year Computer Science student passonate about Machine Learning, Natural Language Processing, and Large Language Models.
I'm currently pursuing my Bachelor's at the University of Information Technology (VNUHCM-Vietnam), focusing on advancing AI innovation and real-world applications.

SKILLS

Large Language Models: Transformers (Hugging Face), SentenceTransformer, Langchain, LLaMA, LoRA (PEFT), Promt Engineering, RAG
LLM APIs: Groq API, OpenAI API
Vector Databases: Chroma, FAISS
Natural Language Processing: Hugging Face Tokenizers, NLTK, SpaCy, Regex
Machine Learning: PyTorch, Scikit-learn, XGBoost
Computer Vision: OpenCV, YOLO
Databases: MySQL (basic), Firebase (basic)
Data Analysis: NumPy, Pandas, Polars (basic)
Big Data: Apache Spark (basic), Hadoop (basic)
Data Visualization: Matplotlib, Seaborn, Plotly
Languages: Python, C++, SQL (basic), HTML (basic)
Version Control: Git, GitHub, GitHub Actions
IDEs: VS Code, Jupyter Notebooks
Containerization: Docker
Cloud: AWS (basic)
MLOps: Weights & Bias, MLflow (basic)
Operating Systems: Ubuntu (basic), Windows
Languages: English (IELTS 7.0)
Teamwork: Presentation, Critical Thinking, Problem Solving

Projects

Undergraduate Projects - NLP Related

Vietnames Legal Document Retrieval

Vietnames Legal Document Retrieval โš–๏ธ

Semantic retrieval system for Vietnamese legal documents using Sentence Transformers, fine-tuned on custom data and evaluated with MTEB, supporting scalable vector search via FAISS.

Python, HuggingFace, SentenceTransformers, Gradio, Docker, Pandas, FAISS

Semantic Book Recommendation System

Semantic Book Recommendation System ๐Ÿ“š

Semantic book recommender using Sentence Transformers, Zero-shot genre classification, and Gradio UIโ€”enhanced by EDA and Sentiment Analysis for personalized, label-free recommendations.

Python, HuggingFace Transformers, Gradio, Pandas, NumPy, Promt Engineering

Job Application Assistant

Job Application Assistant ๐Ÿ’ผ

A GenAI assistant that automates cold emails and provides resume-tailored guidance via LangChain, FAISS-based retrieval, and a Streamlit interface for interactive job application support.

Python, Llama, LangChain, FAISS, Streamlit

Dense Passage Retrieval

Dense Passage Retrieval ๐Ÿ“ƒ

PyTorch reimplementation of the paper "Dense Passage Retrieval for Open-Domain Question Answering" (2022)

Python, PyTorch, FAISS, Transformers, Numpy, Pandas

Undergraduate Projects - CV Related

Scene Text Recognition

Scene Text Recognition ๐ŸŽฅ

A scalable OCR system using YOLOv11m and CRNN with CTC loss, deployed via FastAPI + Ray Serve, supporting real-time OCR with GPU acceleration and an interactive Streamlit UI.

Python, PyTorch, YOLOv11m, CRNN, FastAPI, Ray Serve, Streamlit, OpenCV

Neural Style Transfer

Neural Style Transfer ๐ŸŽจ

A real-time neural style transfer application using feed-forward networks and OpenCV, deployed with Streamlit.

Python, PyTorch, VGG, TransformerNet, OpenCV, Numpy

Real-time Face Attendance System

Real-time Face Attendance System (Ongoing)

A real-time Face Recognition application using fine-tuned CNNs and OpenCV, deployed with Streamlit.

Python, PyTorch, CNNs, OpenCV, FAISS, Numpy, Streamlit

Undergraduate Projects - Data Science Related

Data Science Job Salary Prediction

Data Science Job Salary Prediction ๐Ÿ“Š

A salary prediction system for Data Science roles using XGBoost and Glassdoor data, featuring ML model comparison, hyperparameter tuning, and an interactive Streamlit app for real-time predictions.

Python, Scikit-learn, XGBoost, Pandas, NumPy, Matplotlib, Seaborn, Streamlit

Timeline