#research

2 posts

asrnlpresearch

From 34% to 28% WER: lessons from code-switching ASR

What I learned building a Whisper + LLaMA speech recognizer for Malay–English code-switching — where the WER actually came from, and what didn't help.

Jun 18, 2026 4 min read

computer-visionresearch

Hybrid CNN + Vision Transformer for deepfake detection

Why combining CNNs, InceptionNeXt, and a Vision Transformer beat either alone for video deepfake detection — and why cross-dataset generalization is the metric that matters.

Jun 6, 2026 3 min read

#research

Posts tagged research

From 34% to 28% WER: lessons from code-switching ASR

Hybrid CNN + Vision Transformer for deepfake detection