Content Moderation Testing Dashboard

This dashboard demonstrates a two-step content moderation process:
(1) First, content is checked against a comprehensive list of prohibited words based on the selected rating.
(2) If it passes the initial filter, GPT-4o-mini AI analysis is used to evaluate context, tone, and implicit meaning. This approach helps catch both explicit and subtle policy violations.

Loading moderation database...

Submit Content for Moderation

Enter text or a YouTube URL to test our content moderation system

You can use 'test-bad-word' as a failed badword

Submission History