Portfolio

Projects & Achievements

Things I've built and the research I've worked on — from code-switching speech recognition and deepfake detection to multi-agent LLM systems. Most have code on GitHub.

KX-Solution

2026

AutoNxt AI · Kaizenics

A domain-agnostic agentic AI platform — multi-agent orchestration that adapts across domains. Built at AutoNxt AI for Kaizenics, with a software-defined-vehicle application running virtual cars on RemotiveLabs ECUs with Android Automotive and CARLA.

PythonMulti-agentRemotiveLabsAndroid AutomotiveCARLANeo4jDocker

Baby Safety Reminder

2026

AutoNxt AI · digital.auto

A child-presence safety reminder that helps prevent children from being left unattended in a vehicle — built at AutoNxt AI with digital.auto.

digital.autoVSSPythonComputer Vision

Code-Switching Automatic Speech Recognition (Malay–English)

2025

Research @ A*STAR I²R, Singapore

A hybrid Whisper-encoder + LLaMA-decoder ASR system for Malay–English code-switching, trained with Noisy Student Training and teacher–student pseudo-labeling for low-resource robustness.

Reduced WER from 34% → 28% on the CS-ASR benchmark

PyTorchWhisperLLaMAHugging Face

Multi-Agent E-commerce Assistant

2025

A modular multi-agent system automating customer support, product recommendation, and dynamic pricing — RAG-based reasoning and memory over a streaming backend with a TypeScript front-end.

End-to-end prototype: ingestion → embeddings → agents → API → UI

LangChainAutoGenFastAPIKafkaQdrantTypeScript

CIViT — Deepfake Detection

2024

Research @ CIS Lab, National Chung Cheng University (Taiwan)

A hybrid CNN + InceptionNeXt + Vision Transformer pipeline for video deepfake detection, trained across Celeb-DF (V2), DFD, DeepfakeTIMIT and WildDeepfake with strong augmentation.

91% accuracy on CelebDF-V2, 94% on DFD

PyTorchVision TransformerInceptionNeXtOpenCV

Fashion Image Captioning (BEiT + GPT-2)

2024

A Transformer encoder–decoder that generates fashion product descriptions, using BEiT for visual features and GPT-2 for generation, evaluated on the FACAD dataset.

ROUGE-L 67.5 / BLEU-1 65.4 — beats ViT & DeiT baselines

PyTorchBEiTGPT-2Hugging Face

RAG for Financial Information

2024

A retrieval-augmented generation pipeline for question-answering over financial documents.

LangChainVector DBPython

Vietnamese Sentiment Classification

2023

Vietnamese sentiment classification experiments comparing different word-embedding approaches.

Pythonscikit-learnNLP

GraphFrames on Spark

2023

Large-scale graph analytics with GraphFrames on Apache Spark.

Apache SparkGraphFrames