Natural Language Processing Reading List

Please send me an ordered list of papers that you would like to present, and any dates that you absolutely cannot lead class.

If you don't like any of the papers below, feel free to suggest others.

If this constraint satisfaction process proves intractable, I will randomly assign students to topics/dates.

Chapter 3

Memory-Based Morphological Analysis, Antal van den Bosch and Walter Daelemans, Proceedings of ACL, 1999.

Chapter 6

Language and Task Independent Text Categorization with Simple Language Models, Fuchun Peng, Dale Schuurmans and Shaojun Wang, Proceedings of HLT-NAACL, 2003.

Chapter 8

Detecting Errors in Part-of-Speech Annotation , Markus Dickinson and Detmar Meurers, Proceedings of EACL, 2003.

Dialogue Act Tagging with Transformation-Based Learning, Ken Samuel, Sandra Carberry and K. Vijay-Shanker, Proceedings of ACL, 1998.

Chapter 9

Learning PP attachment for filtering prosodic phrasing, van Herwijnen, van den Bosch, Terken, and Marsi, Proceedings of EACL, 2003.

A Simple Pattern-matching Algorithm for Recovering Empty Nodes and their Antecedents, Mark Johnson, Proceedings of ACL, 2002.

Chapter 10

Shallow Parsing on the Basis of Words Only: A Case Study, Antal van den Bosch and Sabine Buchholz, Proceedings of ACL, 2002.

An Empirical Evaluation of Probabilistic Lexicalized Tree Insertion Grammars, Rebecca Hwa, Proceedings of ACL-COLING, 1998.

Question-Answering and Class Project

GATE: An Architecture for Development of Robust HLT Applications, Hamish Cunningham; Diana Maynard; Kalina Bontcheva; and Valentin Tablan, Proceedings of ACL, 2002.

Performance Issues and Error Analysis in an Open-Domain Question Answering System, Dan Moldovan; Marius Pasca; Sanda Harabagiu; and Mihai Surdeanu, Proceedings of ACL, 2002.
OR
Experiments with Open-Domain Textual Question-Answering, Harabagiu, Pasca and Maiorano, Proceeding of COLING, 2000.

Reading Comprehension Programs in a Statistical-Language-Processing Class, Eugene Charniak, Yasemin Altun, Rodrigo de Salvo Braz, et al., Proceedings ANLP/NAACL Workshop on Reading Comprehension Tests as Evaluation for Computer-Based Language Understanding Systems, 2000.
OR
A Rule-based Question Answering System for Reading Comprehension Tests, Ellen Riloff and Michael Thelen, Proceedings ANLP/NAACL Workshop on Reading Comprehension Tests as Evaluation for Computer-Based Language Understanding Systems, 2000.
OR
A Machine Learning Approach to Answering Questions for Reading Comprehension Tests, Ng, Hwee Tou, & Teo, Leong Hwee, & Kwan, Jennifer Lai Pheng, Proceedings of EMNLP/VLC-2000, 2000.
OR
Deep Read: A Reading Comprehension System, Lynette Hirschman and Marc Light and Eric Breck and John D. Burger, Proceedings of ACL, 1999.

Chapter 11

Example Selection for Bootstrapping Statistical Parsers , M. Steedman, R. Hwa, S. Clark, M. Osborne, A. Sarkar, J. Hockenmaier, P. Ruhlen, S. Baker, and J. Crim, Proceedings of HLT-NAACL, 2003.
OR
On minimizing training corpus for parser acquisition, Rebecca Hwa, Proceedings of the Fifth Computational Natural Language Learning Workshop, 2001.

Partial Parsing via Finite-State Cascades, Steven Abney, Journal of Natural Language Engineering, 2(4), 1996.

Chapter 14

COGEX: A Logic Prover for Question Answering, Dan Moldovan, Christine Clark, Sanda Harabagiu and Steve Maiorano, Proceedings of HLT-NAACL, 2003.

Chapter 15

Learning Extraction Patterns for Subjective Expressions, Ellen Riloff and Janyce Wiebe, Proceedings Conference on Empirical Methods in Natural Language Processing (EMNLP-03), 2003.

Information Extraction from Voicemail Transcripts, Martin Jansche and Steven Abney, Proceedings of EMNLP, 2002.

Chapter 16

Domain-transcending mappings in a system for metaphorical reasoning, John Barnden, Sheila Glasbey, Mark Lee, and Alan Wallington, Proceedings of EACL, 2003.

A General Feature Space for Automatic Verb Classification, Eric Joanis and Suzanne Stevenson, Proceedings of EACL, 2003.
OR
A Multilingual Paradigm for Automatic Verb Classification , Paola Merlo; Suzanne Stevenson; Vivian Tsang; and Gianluca Allaria, Proceedings of ACL, 2002.

The Necessity of Parsing for Predicate Argument Recognition, Daniel Gildea; Martha Palmer, Proceedings of ACL, 2002.
OR
The Descent of Hierarchy, and Selection in Relational Semantics, Barbara Rosario; Marti Hearst; Charles Fillmore, Proceedings of ACL, 2002.

Chapter 17

An Unsupervised Method for Word Sense Tagging using Parallel Corpora , Mona Diab and Philip Resnik, Proceedings of ACL, 2002.

Unsupervised Word Sense Disambiguation Rivaling Supervised Methods, David Yarowsky, Proceedings of ACL, 1995.

Chapter 18

An Unsupervised Approach to Recognizing Discourse Relations, Daniel Marcu and Abdessamad Echihabi, Proceedings of ACL, 2002.

Applying Co-Training to Reference Resolution, Christoph Mueller; Stefan Rapp; and Michael Strube, Proceedings of ACL, 2002.
OR
Resolving Pronominal Reference to Abstract Entities, Donna K. Byron, Proceedings of ACL, 2002.
OR
Improving Machine Learning Approaches to Coreference Resolution, Vincent Ng and Claire Cardie, Proceedings of ACL, 2002.
OR
Pronominalization in Generated Discourse and Dialogue, Charles B. Callaway; James L. Lester, Proceedings of ACL, 2002.

Chapter 19

A model of back-channel acknowledgements in spoken dialogue, Cathcart, Carletta, and Klein, Proceedings of EACL, 2003.

Predicting and Adapting to Poor Speech Recognition in a Spoken Dialogue System, Diane J. Litman and Shimei Pan, Proceedings of AAAI, 2000.
OR
Automatic Optimization of Dialogue Management, Diane J. Litman, Michael S. Kearns, Satinder Singh, and Marilyn A. Walker. Proceedings of COLING, 2000.