Facialabuse-gaia-3
Key advertised features:
| Component | Details | |-----------|---------| | | ViT‑L/14 pre‑trained on ImageNet‑21k, fine‑tuned on a curated “GAIA‑3 Abuse Corpus” (≈ 1.2 M images, 250 k video clips). | | Temporal Module | 3‑layer TCN (kernel = 3, dilation = 2ⁿ) for 5‑frame sliding windows. | | Prompt Encoder | Small BERT‑base model that maps textual prompts (e.g., “detect deepfakes where the subject is a minor”) into a shared embedding space. | | Losses | Multi‑label binary cross‑entropy + a contrastive loss encouraging separation between abuse and benign “face‑only” samples. | | Data Augmentation | Random cropping, color jitter, synthetic deep‑fake generation (using FaceSwap, DeepFaceLab) to balance minority abuse sub‑classes. | Facialabuse-gaia-3
The model is distributed under a (non‑commercial) and is hosted on a public GitHub repository with accompanying Docker images, a Python SDK, and a web‑demo UI. Key advertised features: | Component | Details |