Contact us
Contact us

Datasets

Browse through our publications on language processing technology and social media trend detection.

Back to resources
custom_hero_background

Discover our datasets

We offer a small range of data management and analysis software applications, including digitising non-digital archives. We also offer tailored and standard training programs to expand your knowledge on what happens in the online world.

EN Toxic Word Embeddings

Freely downloadable English word embeddings, trained on tendentious 4chan and 8chan data.

Download

NL Toxic Word Embeddings

A Dutch word embedding model that captures toxic language expressions on for example GeenStijl.nl.

Download

NL Word Embeddings

Freely downloadable Dutch word embeddings, trained on massive amounts of data.

Download

Want to discuss one of our products in more detail?

A short gettogether is always more enlightening.

Would our products be right for you? How could they help you? Just talk to one of our experts.

Get in touch with us!