Researchers from the Queensland University of Technology (QUT) in Australia have developed an algorithm that detects misogynistic content on Twitter.
The team developed the system by first mining 1 million tweets. They then refined the dataset by searching the posts for three abusive keywords: whore, slut, and rape.
Next, they categorized the remaining 5,000 tweets as either misogynistic or not, based on their context and intent. These labeled tweets were then fed to a machine learning classifier, which used the samples to create its own classification model.
The system uses a deep learning algorithm to adjust its knowledge of terminology as language evolves. While the AI built up its vocabulary, the researchers monitored the context and intent of the language, to help the algorithm differentiate between abuse, sarcasm, and âfriendly use of aggressive terminology.â
âTake the phrase âget back to the kitchenâ as an example â devoid of context of structural inequality, a machineâs literal interpretation could miss the misogynistic meaning,â said Professor Richi Naya, a co-author of the study.
âBut seen with the understanding of what constitutes abusive or misogynistic language, it can be identified as a misogynistic tweet.â
[Read: ]
Nayak said this enabled the system to understand different contexts just by analyzing text, and without the help of tone.
We were very happy when our algorithm identified âgo back to the kitchenâ as misogynistic â it demonstrated that the context learning works.
The researchers say the model identifies misogynistic tweets with 75% accuracy. It could also be adjusted to spot racism, homophobia, or abuse of disabled people.
The team now wants social media platforms to develop their research into an abuse detection tool.
âAt the moment, the onus is on the user to report abuse they receive,â said Naya. âWe hope our machine-learning solution can be adopted by social media platforms to automatically identify and report this content to protect women and other user groups online.â
You can read the research paper on the Springer database of academic journals.
So you like our media brand Neural? You should join our Neural event track at TNW2020, where youâll hear how artificial intelligence is transforming industries and businesses.
Get the TNW newsletter
Get the most important tech news in your inbox each week.