Data Science Projects & Blogs
A collection of my Data Science projects and writings, covering topics such as machine learning, deep learning, causal inference, and data analysis.
-
Language Identification Model Evaluation - Part 02 - Replicate WordLlama Detect Model
Learning about the
WorldLamma Detectmodel and attempt to replicate its architecture by experimenting with alternative embedding backbones to improve performance -
Language Identification Model Evaluation – Part 01
A project that create a benchmark to evaluate multiple language identification models
-
Thesis - Replication study: Invariant model for causal transfer learning
My thesis
-
Kaggle Competition: Eedi – Mining Misconceptions in Mathematics
Math misconception prediction using embedding-based retrieval and LLM reranking — no model training required.
-
Stock Market Dashboard with Integrated Chatbot
A project that analyzes the stock market trends and integrates a chatbot to provide newcomers with insightful analysis and information about investing.
-
Kaggle Competition: Learning Agency Lab - Automated Essay Scoring 2.0
Our team developed a machine learning model to automatically score essays based on the Holistic Scoring Rubric.
-
Origami Style Prediction
A project to classify origami styles and identify which origamists’ style is most similar to my folding technique.
-
2022 Data Science and Machine Learning State Analysis
A project to explored a Kaggle dataset to get insights about people who is working with data in different industries.
-
House Price Prediction
A project to discorver the real estate’s state and predict the price of apartment in Ho Chi Minh City.