Project team at CLiPS (University of Antwerp)

PhD student: Kim Luyckx
Promotor: Prof. dr. Walter Daelemans
Co-promotors: dr. Guy De Pauw and Edward Vanhoutte

Duration

January 2007 - end of December 2010

Abstract

Project funded by the National Science Foundation (FWO)

In this project, we investigate a methodology for the automatic extraction and analysis of style that we want to apply to both individual authors (authorship attribution, both fiction and non-fiction) and groups of authors (extraction of stylistic characteristics associated to gender and age). This methodology covers several aspects:

  1. Automatic linguistic analysis of documents by means of available text analysis tools on the level of morphological structure, part of speech, global syntactic structures and semantic roles (subject, object, temporal, location) for the construction of potentially relevant stylistic characteristics.
  2. Unsupervised and supervised learning techniques for selecting characteristics with high information value and constructing a model of authorial style.
  3. Evaluation of these models by (a) comparison with stylistic analyses in linguistics and literary science and (b) empiric testing of the predictive power of the models.

Expected results

Selected stylometry bibliography