The Institute
Publications
Projects
Software
CDEL 
Sound Lab
Resources at 
Indiana University
Opportunities at AISRI
Related Links
News and 
Events
Search the AISRI site!
Help

Projects

Projects | Research | Editorial | Educational

The Annotated Text Processor (ATP)

AISRI's Annotated Text Processor (ATP) is a text processor designed to manage interlinear text and to support the operations of several kinds of linguistic analysis including parsing and glossing.

Note: ATP is currently a beta release. To download the beta please contact aisri@indiana.edu
In addition, any technical support questions may also be directed to aisri@indiana.edu

Click here to take a tour of ATP's features and functions.


(click here to enlarge image)

Interlinear text analysis is a fundamental form of linguistic calculus in which a text is organized word by word and aligned into blocks with glosses or morphemic analyses listed across several lines. In the example above, the first line contains the phonemic words, the second parses the phonemic words into sequences of morphemes, the third gives the gloss of each constituent morpheme, and the fourth line provides a literal gloss of the phoneme with which it's aligned. ATP automatically reformats interlinear text as it is being entered. The sample text is a text from Douglas Parks' collection of South Band Pawnee texts being processed in ATP as part of his Northern Caddoan Languages Documentation Project.

ATP provides computer-aided glossing, parsing, and annotation of linguistic text data. On request ATP retrieves possible glosses or annotations for a given word from a collection of prior annotations in the same corpus. ATP's annotator interface (controls along the right edge) supports interactive glossing, annotation, and parsing operations during analysis and document construction work. The example here shows the result of a fuzzy search (highlighted in blue). A student working with a new language may find fuzzy matches useful in translation work.

ATP reads and searches IDD dictionary databases and is designed to write text examples into IDD dictionaries where the IDD user wishes to include them. ATP's media player (at bottom center) allows the user to work directly with sound and video data for transcription and analysis.

The document-processing interface displays the data model on which the current document is based (tree control at the upper left corner) and a map of the document as constructed (tree control at the lower left corner). ATP provides writing and editing tools found on most commercial word processors. The user has full control over the fonts used to display text and can take advantage of ATP's orthographic functions define customized sort orders and keyboards.

Acknowledgements: Development of ATP was supported by the National Science Foundation under Grant Nos. BCS-9875895, BCS-0215574, and BCS-0421838.

Back to Research Projects

© 2006, The Trustees of Indiana University