Here is a list of older research projects and several class-related projects.


Ensemble Methods in Question Answering for Reading Comprehension Tests

We empirically evaluate whether combining the outputs of seven reading comprehension QA systems submitted as the final projects for a graduate level class can improve over the performance of any individual system. Our results, replicated using two different publicly available reading test corpora, demonstrate the utility of system combination via majority voting in our restricted domain question answering task.

[Publications]


Exceptionality and Natural Language Learning

Previous work has shown that when machine learning is applied to many natural language processing tasks, exceptional training examples play an important role in improving generalization accuracy. We are exploring whether such results generalize to spoken dialogue, and how different formalizations of "exceptionality" impact the performance of memory-based and rule-based learning algorithms.

[Publications]


Supporting user navigation in complex data spaces

We propose a graphical representation of the query history for users exploring a complex data space (e.g. real estate domain) in a multimodal dialogue system. The graphical representation enhances exploration by providing a structured view of the history and by predicting items of interest for the user. It can be used to guide tasks like query relaxation, summarization and query refinement and for information push.

This is my 2004 internship project at IBM T.J. Watson research center. For more details, contact my internship mentor, Shimei Pan.


User preference modeling

We automatically extract user preferences from user's interaction with a multimodal dialogue system in the real estate domain. We build a user preference model which is used to guide query relaxation in the system: whenever the user's query returns no items, a set of relevant items are proposed based on the preference model.

This is my 2003 internship project at IBM T.J. Watson research center. For more details, contact my internship mentor, Shimei Pan.


PCA, PPCA and PHITS comparison

This project compares the performance of 3 methods for finding representative components in a dataset: Principal Component Analysis (PCA), Probabilistic PCA (PPCA) and Probabilistic HITS (PHITS). Two type of data sets are used: a link data set (copublication data set from Auton Lab) and a microarray dataset (DNA microarray containing genes of Saccharomyces cerevisiae).

Class: Advanced Topics in Machine Learning
Project report


Compressing natural speech

This project explores an alternative compression technique for sound files containing human speech only (e.g. an interpretation of a text by a voice talent). The technique saves only the speech transcription (obtained manually) and the prosodic information (obtained automatically by running the Sphinx2 recognizer in forced alignment mode). The sound file is reconstructed from this information using a speech synthesized trained on the same voice. Results show a very good compression rate (1% of the MP3 version of the original file), a relatively good quality of the reconstructed sound file and good transfer to synthesizers built from other voices.

Class: Speech II - Phonetics, Prosody, Perception, Synthesis
Project report, Presentation


Command speech versus normal speech

This projects is a preliminary study of the differences between command speech and normal speech. Command speech is the speaking style used when uttering a command to a human or machine. A corpus of normal and command utterances was collected and analyzed.

Class: Speech II - Phonetics, Prosody, Perception, Synthesis
Project report, Presentation


Dialog Act Tagging using Memory Based Learning

We apply memory-based learning to predict the dialog act of user turns in the Switchboard corpus. Word bigrams are hashed to a fixed number of feature slots to reduce the number of features available to the machine learning algorithm. The TiMBL software package was used in the experiments.

Class: Advanced Topics in Artificial Intelligence (Dialog Systems)
Project report, Presentation