MBSP is a set of linguistic tools based on the Timbl and Mbt memory based learning applications developed at CLiPS and ILK. It provides tools for Part of Speech tagging, Chunking, Lemmatizing, Relation Finding and (for medical language) Semantic tagging.
The general English version of MBSP has been trained on data from the Wall Street Journal corpus, the (bio-)medical English version was originally developed for use in the BioMint Text Mining tool and uses training data from the GENIA corpus.
Daelemans, W., Bucholz, S., and Veenstra, J. (1999) Memory-Based Shallow Parsing. Proceedings of CoNLL-99, Bergen, Norway, pp. 53-60.