Creating TwiSty: Corpus Development and Statistics

This document provides information on the creation of the Twitter Stylometry (TwiSty) corpus (Verhoeven et al., 2016). The corpus contains Twitter profiles annotated with MBTI personality types and gender information, covering six languages: Italian (IT), Dutch (NL), German (DE), Spanish (ES), French (FR), and Portuguese (PT).

Issue #: 
006
Author(s): 

Ben Verhoeven
Walter Daelemans
Barbara Plank

ISSN: 
2033-3544
Published: 
01/05/2016
AttachmentSize
ctrs-6351.09 KB
ctrs-006-front.jpg