Week 5: Text Mining
The lecture and seminar will be led by Chris Huyck.
Text Mining is Data Mining with natural Language. Natural language
is the kind of languages people speak like English, French, or Urdu; this
is opposed to formal languages which include HTML, C++, and ASCII. There
are a number of techniques that can be used to help understand
natural language, and make programs that extract information from
natural language.
Church and Rau's ACM paper. Additionally,
James Allen's Natural Language Understanding Book is a great
introduction to the theory and practice of text mining. It's one
of the recommended readings.