computer-visionresearch
Hybrid CNN + Vision Transformer for deepfake detection
Why combining CNNs, InceptionNeXt, and a Vision Transformer beat either alone for video deepfake detection — and why cross-dataset generalization is the metric that matters.