Contact us
Contact us

Datasets

Browse through our publications on language processing technology and social media trend detection.

Back to resources
custom_hero_background

Discover our datasets

We offer a small range of data management and analysis software applications, including digitising non-digital archives. We also offer tailored and standard training programs to expand your knowledge on what happens in the online world.

EN Toxic Word Embeddings

English word embeddings, trained on tendentious 4chan and 8chan data.

Request

NL Toxic Word Embeddings

Dutch word embeddings that capture tendentious language on for example GeenStijl.nl.

Request

NL Word Embeddings

Freely downloadable Dutch word embeddings, trained on massive amounts of data.

Download

Get in touch

Contact us to find out how we can help you.

Contact us!