Communal and Misogynistic Aggression in Hindi-English-Bangla (The ComMA Project)

Our project, in collaboration with Dr. Bhimrao Ambedkar University and UnReaL-TecE LLP, aims to understand the construction and evaluation of communal and sexually threatening misogynistic content on social media in India. We have developed a multilingual, multimodal dataset containing 57,363 annotated comments, 1,142 annotated memes, and around 70 hours of annotated audio, which has been collected from various social media platforms. Additionally, we have developed an automatic system that identifies communal and misogynistic aggression in four languages.

Project Docs and Details

The Project Website

Dataset and Models

Tagset and Annotation Guidelines:

Publications

Events and Training

Communal and Misogynistic Aggression in Hindi-English-Bangla (The ComMA Project)

Project Docs and Details

Team

Publications

Blind Spot

Events

podcasts

Share on Mastodon