Previous abstract | Contents | Next abstract

Named entity learning and verification: EM in large corpora

The regularity of named entities is used to learn names and extract named entities. Having only a few name elements and a set of patterns the algorithm learns new names and its elements. A verification step assures quality using a large background corpus. Further improvement is reached through classifying the newly learnt elements on character level. Moreover, unsupervised rule learning is discussed.

Uwe Quasthoff and Christian Biemann, Named entity learning and verification: EM in large corpora. In: Dan Roth and Antal van den Bosch (eds.), Proceedings of CoNLL-2002, Taipei, Taiwan, 2002, pp. 8-14. [ps] [ps.gz] [pdf] [bibtex]
Last update: September 06, 2002.