Content Moderation Testing Dashboard
This dashboard demonstrates a two-step content moderation process:
(1) First, content is checked against a comprehensive list of prohibited words based on the selected rating.
(2) If it passes the initial filter, GPT-4o-mini AI analysis is used to evaluate context, tone, and implicit meaning. This approach helps catch both explicit and subtle policy violations.
Loading moderation database...
Submit Content for Moderation
Enter text or a YouTube URL to test our content moderation system
You can use 'test-bad-word' as a failed badword
Submission History
Recent content moderation results
No submissions yet. Submit content to see results here.