Here you’ll find a collection of publications, blogs, data sets and other useful content


Publications co-authored by Textgain team members. Some of these are available for download from this site.

Online hatred of women in the forum – Linguistic analysis and automatic detection

Automatic detection of cyberbullying in social media text

Summary Authors Summary While social media offer great communication opportunities, they also increase the vulnerability of young people to threatening situations online. Recent studies report that cyberbullying constitutes a growing problem among youngsters. Successful prevention depends on the adequate detection … Read More

Text-Based Age and Gender Prediction for Online Safety Monitoring

Automatic Detection of Online Jihadist Hate Speech

Summary Author Summary We have developed a system that automatically detects online jihadist hate speech with over 80% accuracy, by using techniques from Natural Language Processing and Machine Learning. The system is trained on a corpus of 45,000 subversive Twitter … Read More

Multilingual Cross-domain Perspectives on Online Hate Speech


True to our roots, we have several open-source libraries available on Github.


Arabic Dialect Identification


GDPR Anonymization Tool


Data sets

Whenever possible, we like to make our data sets available. This is not always possible due to GDPR restrictions, but we share whatever we can.

4chan & 8chan Word Embeddings

Dutch Word Embeddings

Our latest blog posts

‘Taal is de sleutel tot echte artificiële intelligentie’

Featured Post

Textgain featured in “Benchmark studie over artificiële intelligentie”

PWC published a study of Artificial Intelligence vendors in Flanders. Textgain is also featured in this overview as a spin-off of the University of Antwerp. The report can be downloaded here.

Featured Post

Textgain featured in “Text Analytics APIs 2018”

Textgain is featured in Text Analytics APIs 2018: A Consumer Guide, a 300 page report on the state-of-the-art in Text Analytics APis. Robert Dale of the Language Technology Group has compiled a comprehensive report on currently available technologies. A free … Read More

Featured Post
Create a free account or contact sales