Previous abstract | Contents | Next abstract

Feature selection for a rich HPSG grammar using decision trees

This paper examines feature selection for log linear models over rich constraint-based grammar (HPSG) representations by building decision trees over features in corresponding probabilistic context free grammars (PCFGs). We show that single decision trees do not make optimal use of the available information; constructed ensembles of decision trees based on different feature subspaces show significant performance gains (14% parse selection error reduction). We compare the performance of the learned PCFG grammars and log linear models over the same features.

Kristina Toutanova and Christopher Manning, Feature selection for a rich HPSG grammar using decision trees. In: Dan Roth and Antal van den Bosch (eds.), Proceedings of CoNLL-2002, Taipei, Taiwan, 2002, pp. 77-83. [ps] [ps.gz] [pdf] [bibtex]

Last update: September 07, 2002. erikt@uia.ua.ac.be