SupraLabs has released SupraSafety-18M, a BERT-style binary text classifier with 18 million parameters designed for content moderation on edge devices and mobile phones. The model was trained from scratch on the nvidia/Nemotron-3.5-Content-Safety-Dataset and achieves an accuracy of 81.2% and precision of 86.9%.
- Trained from scratch on 2 T4 GPUs in Kaggle for 7 epochs using the nvidia/Nemotron-3.5-Content-Safety-Dataset.
- Optimized for low-latency production environments, edge devices, and mobile phones.
- Classifies text as either SAFE or UNSAFE with high confidence levels in examples (e.g., 99.6% for unsafe bomb-making queries).
- Available on Hugging Face under the SupraLabs organization.
The model enables efficient content moderation capabilities in resource-constrained environments where running larger models is impractical.