Catapult Center Workshops:
Basic Skills in the Digital Humanities
Register today for this free workshop at:
April 12, 2013 (Friday)
10:00 am – 1:00 pm (with break)
Wells Library Information Commons Cluster 1
Independent of which research area we work in, most of us often have to search in texts. Sometimes, just searching for individual words, or sequences of them, is insufficient because the search results cover too many phenomena that are not interesting to us. This workshop will introduce more powerful methods for searching in text. One such method will be the use of regular expressions. This allows us to search for more general sequences, for example “as … as”. This type of search can be used in a wide variety of text processing applications.
However, we will also look at linguistic annotations, such as part-of-speech tags, syntax, and discourse annotation (in the form of frame annotation). Such annotations can help if we want to search for specific linguistic patterns. For example, if we want to search for occurrences of “as soon as”, “as long as”, etc., we can make the search more general and search for “as ADJ as”. However, this means, we need information about the parts of speech of words.
In the workshop, we will discuss which resources are available, how such annotations can be performed automatically, how we can search in such texts with linguistic annotations, what types of errors we have to expect from the automatic annotations, and what consequences this has for searching in the data.
No linguistic or programming experience necessary.