This preview shows page 1. Sign up to view the full content.
Unformatted text preview: , etc. for each term.
Text chunking aims at grouping adjacent words in a sentence. An example of a chunk is the noun phrase “the current account deﬁcit”.
Word Sense Disambiguation (WSD) tries to resolve the ambiguity in the meaning of single words or phrases. An example is ‘bank’ which may have – among
others – the senses ‘ﬁnancial institution’ or the ‘border of a river or lake’.
Thus, instead of terms the speciﬁc meanings could be stored in the vector
space representation. This leads to a bigger dictionary but considers the
semantic of a term in the representation.
Parsing produces a full parse tree of a sentence. From the parse, we can ﬁnd the relation of each word in the sentence to all the others, and typically
also its function in the sentence (e.g. subject, object, etc.).
Linguistic processing either uses lexica and other resources as well as handcrafted rules. If a set of examples is available machine learning methods as
described in section 3, especially in section 3.3, may be employed to learn the
It turned out, however, that for many text mining tasks li...
View Full Document
This note was uploaded on 06/19/2011 for the course IT 2258 taught by Professor Aymenali during the Summer '11 term at Abu Dhabi University.
- Summer '11