|
|
|
|
| · |
Kateherine Forbes-Riley, Diane Litman, Mihai Rotaru (2008) “Responding to Student Uncertainty during Computer Tutoring: A Preliminary Evaluation” In Proceedings of the 9th International Conference on Intelligent Tutoring Systems (ITS),Montreal, Canada, (to appear in June).
[abstract]
|
|
| |
This paper evaluates dialogue-based student performance in a controlled experiment using
versions of a tutoring system with and without automatic adaptation to the student affective state of uncertainty.
Our performance metrics include correctness, uncertainty, and learning impasse severities, which are
measured in a "test" dialogue after the tutoring treatment.
Although these metrics did not significantly differ across conditions when
considering all student answers in our test dialogue, we found significant differences in specific
types of student answers, and these differences suggest that our
uncertainty adaptation does have a positive benefit on student performance.
|
|
| · |
Greg Nicholas, Mihai Rotaru, Diane J. Litman (2008) “An Investigation of Using Word-level Features for Emotion Prediction”. Submitted to Speech Communication.
|
| · |
Katherine Forbes-Riley, Mihai Rotaru, Diane J. Litman (2008) “The Relative Impact of Student Affect on Performance Models in a Spoken Dialogue Tutoring System”. User Modeling and User Adapted Interaction (UMUAI) Special Issue on Affective Modeling and Adaptation, 18 (1-2).
[abstract]
|
|
| |
We hypothesize that student affect is a useful predictor of spoken
dialogue system performance, relative to other parameters. We test this hypothesis
in the context of our spoken dialogue tutoring system, where student learning is
the primary performance metric. We first present our tutoring system and corpora,
which have been annotated for student affect, correctness, and discourse structure.
We then discuss unigram and bigram parameters which we derive from these corpora.
The unigram parameters represent each annotation type individually, as well as
system-generic features. The bigram parameters represent annotation combinations,
including student state sequences and student states in the discourse structure
context. We then use these parameters to build learning models. First, we build
simple models based on correlations between each of our parameters and learning.
Our results suggest that student affect parameters are among our most useful predictors
of learning, particularly in specific discourse structure contexts. Next, we build
complex learning models, using the PARADISE framework, in which our parameters
are input to a multivariate linear regression, yielding a model containing only the
most useful subset of parameters. Our approach is a value-added one; we perform
a number of model-building experiments, both with and without including student
affect parameters, and then compare the performance of the models on the training
and the test sets. Our results show that when included as inputs, affect parameters
are selected as predictors in most models, and many of these models show high
generalizability in testing. Our results also show that overall, the affect-included
models significantly outperform the affect-excluded models.
|
|
|
|
|
| · |
Mihai Rotaru and Diane J. Litman (2007) “The Utility of a Graphical Representation of Discourse Structure in Spoken Dialogue Systems”. In Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL), Prague, Czech Republic.
[abstract]
|
|
| |
In this paper we explore the utility of the Navigation Map (NM), a graphical representation of the discourse structure. We run a user study to investigate if users perceive the NM as helpful in a tutoring spoken dialogue system. From the users' perspective, our results show that the NM presence allows them to better identify and follow the tutoring plan and to better integrate the instruction. It was also easier for users to concentrate and to learn from the system if the NM was present. Our preliminary analysis on objective metrics further strengthens these findings.
|
|
| · |
Katherine Forbes-Riley, Diane J. Litman, Amruta Purandare, Mihai Rotaru and Joel Tetreault (2007). “Comparing Linguistic Features for Modeling Learning in Computer Tutoring Dialogues”. In Proceedings of International Conference on Artificial Intelligence in Education (AIED 2007), Marina Del Rey, USA.
[abstract]
|
|
| |
We compare the relative utility of different automatically computable
linguistic feature sets for modeling student learning in computer dialogue tutoring.
We use the PARADISE framework (multiple linear regression) to build a learning
model from each of 6 linguistic feature sets: 1) surface features, 2) semantic
features, 3) pragmatic features, 4) discourse structure features, 5) local dialogue
context features, and 6) all feature sets combined. We hypothesize that although
more sophisticated linguistic features are harder to obtain, they will yield stronger
learning models. We train and test our models on 3 different train/test dataset pairs
derived from our 3 spoken dialogue tutoring system corpora. Our results show that
more sophisticated linguistic features usually perform better than either a baseline
model containing only pretest score or a model containing only surface features,
and that semantic features generalize better than other linguistic feature sets.
|
|
| · |
Katherine Forbes-Riley, Mihai Rotaru, Diane J. Litman, Joel Tetreault (2007) “Exploring Affect-Context Dependencies for Adaptive System Development”. In Proceedings of HLT/NAACL 2007 (late-breaking news award).
[abstract]
|
|
| |
We use the Chi Square test to investigate the context dependency of student affect in our computer tutoring dialogues, targeting uncertainty in student answers in 3 automatically monitorable contexts. Our results show significant dependencies between uncertain answers and specific contexts. Identification and analysis of these dependencies is our first step in developing an adaptive version of our dialogue system.
|
|
| · |
Mihai Rotaru (2007) “Applications of Discourse Structure for Spoken Dialogue Systems”. Ph.D. Thesis Proposal, Computer Science Department, University of Pittsburgh.
[abstract]
|
|
| |
Due to the relatively simple structure of dialogues in previous spoken dialogue systems, discourse structure has seen limited applications in these systems. We investigate the utility of discourse structure for spoken dialogue systems in complex domains (e.g. tutoring). Two types of applications are being pursued: on the system side and on the user side. On the system side, we investigate if the discourse structure information is useful for various spoken dialogue system tasks: performance analysis, characterization of user affect and characterization of speech recognition problems. On the user side, we investigate whether the discourse structure information is useful for users of a spoken dialogue system through a graphical representation of the discourse structure.
|
|
|
|
|
| · |
Mihai Rotaru, Diane Litman, Hua Ai, Kate Forbes-Riley, Gregory Nicholas, Amruta Purandare, Scott Silliman, Joel Tetreault, and Art Ward (2006) “Using Discourse Structure in Speech-based Computer Tutoring”. In Research Demo Session of IEEE/ACL Workshop on Spoken Language Technology (SLT). Aruba.
[abstract]
|
|
| |
This demonstration illustrates a graphical representation of the discourse structure in our speech-based computer tutor, ITSPOKE. Our system is a speech-enabled version of the Why2-Atlas qualitative physics intelligent tutoring system. We use the discourse structure hierarchy and a manual labeling of the discourse segment purpose to produce a graphical representation we call the Navigation Map. During the dialogue, the Navigation Map is automatically updated to reflect topics discussed in the past, the current topic and the next topic. A preliminary user study investigates whether users prefer the ITSPOKE version with the Navigation Map over the one without the Navigation Map.
|
|
| · |
Greg Nicholas, Mihai Rotaru, and Diane J. Litman (2006) “Exploiting Word-level Features for Emotion Prediction”. In Proceedings of IEEE/ACL Workshop on Spoken Language Technology (SLT). Aruba.
[abstract]
|
|
| |
In this paper we study two techniques for combining word-level features for emotion prediction. Prior research has primarily focused on the use of turn-level features as predictors. Recently, the utility of word-level features has been highlighted but only tested on relatively small human-computer corpora. We extend over previous work by investigating the strengths and weaknesses of two different techniques for using word-level features and by using a larger corpus of human-computer dialogue. Our results confirm that the word-level pitch features fare better than the turn-level ones regardless of the combination technique. In addition, we find that each word combination technique has different strengths and weaknesses in terms of precision and recall.
|
|
| · |
Mihai Rotaru and Diane J. Litman (2006) “Discourse Structure and Speech Recognition Problems”. In Proceedings of Interspeech 2006, Pittsburgh, USA (People's Choice Best Paper Award).
[abstract]
|
|
| |
We study dependencies between discourse structure and speech recognition problems (SRP) in a corpus of speech-based computer tutoring dialogues. This analysis can inform us whether there are places in the discourse structure prone to more SRP. We automatically extract the discourse structure by taking advantage of how the tutoring information is encoded in our system. To quantify the discourse structure, we extract two features for each system turn: depth of the turn in the discourse structure and the type of transition from the previous turn to the current turn. The Chi Square test is used to find significant dependencies. We find several interesting interactions which suggest that the discourse structure can play an important role in several dialogue related tasks: automatic detection of SRP and analyzing spoken dialogues systems with a large state space from limited amounts of available data.
|
|
| · |
Hua Ai, Diane Litman, Kate Forbes-Riley, Mihai Rotaru, Joel Tetreault, and Amruta Purandare (2006). “Using System and User Performance Features to Improve Emotion Detection in Spoken Tutoring Dialogs”. In Proceedings of Interspeech 2006, Pittsburgh, USA.
[abstract]
|
|
| |
In this study, we incorporate automatically obtained system/user performance features into machine learning experiments to detect student emotion in computer tutoring dialogs. Our results show a relative improvement of 2.7% on classification accuracy and 8.08% on Kappa over using standard lexical, prosodic, sequential, and identification features. This level of improvement is comparable to the performance improvement shown in previous studies by applying dialog acts or lexical-/prosodic-/discourse- level contextual features.
|
|
| · |
Mihai Rotaru and Diane J. Litman (2006) “Exploiting Discourse Structure for Spoken Dialogue Performance Analysis”. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Sydney, Australia.
[abstract]
|
|
| |
In this paper we study the utility of discourse structure for spoken dialogue performance modeling. We experiment with various ways of exploiting the discourse structure: in isolation, as context information for other factors (correctness and certainty) and through trajectories in the discourse structure hierarchy. Our correlation and PARADISE results show that, while the discourse structure is not useful in isolation, using the discourse structure as context information for other factors or via trajectories produces highly predictive parameters for performance analysis.
|
|
| · |
Mihai Rotaru and Diane J. Litman (2006) “Dependencies between Student State and Speech Recognition Problems in Spoken Tutoring Dialogues”. In Proceedings of the Joint Conference of the International Committee on Computational Linguistics and the Association for Computational Linguistics (Coling/ACL), Sydney, Australia.
[abstract]
|
|
| |
Speech recognition problems are a reality in current spoken dialogue systems. In order to better understand these phenomena, we study dependencies between speech recognition problems and several higher level dialogue factors that define our notion of student state: frustration/anger, certainty and correctness. We apply Chi Square analysis to a corpus of speech-based computer tutoring dialogues to discover these dependencies both within and across turns. Significant dependencies are combined to produce interesting insights regarding speech recognition problems and to propose new strategies for handling these problems. We also find that tutoring, as a new domain for speech applications, exhibits interesting tradeoffs and new factors to consider for spoken dialogue design.
|
|
|
|
|
| · |
Mihai Rotaru, Diane J. Litman, and Katherine Forbes-Riley (2005) “Interactions between Speech Recognition Problems and User Emotions”. In Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech-2005/Eurospeech), Lisbon, Portugal.
[abstract]
|
|
| |
Understanding how speech recognition problems affect the interaction with the user is a topic of great interest for the spoken dialogue community. In this paper, we examine the dependencies between speech recognition problems in adjacent turns. We also examine the dependencies between speech recognition problems and student emotions within a turn and in adjacent turns. We apply Chi Square analysis to a corpus of speech-based computer tutoring dialogues to discover these dependencies. We find that rejections are followed by more rejections than expected if there was no dependency between rejections, and that misrecognitions are followed by more misrecognitions than expected. We also find a strong dependency between recognition problems in the previous turn and user emotion in the current turn: after a system rejection there are more emotional user turns than expected. Surprisingly, in our data, we find no relationship between user emotions and recognition problems within a turn nor between previous turn user emotions and current turn recognition problems.
|
|
| · |
Mihai Rotaru and Diane J. Litman (2005) “Using Word-level Pitch Features to Better Predict Student Emotions during Spoken Tutoring Dialogues”. In Proceedings of the 9th European Conference on Speech Communication and Technology (Interspeech-2005/Eurospeech), Lisbon, Portugal.
[abstract]
|
|
| |
In this paper, we advocate for the usage of word-level pitch features for detecting user emotional states during spoken tutoring dialogues. Prior research has primarily focused on the use of turn-level features as predictors. We compute pitch features at the word level and resolve the problem of combining multiple features per turn using a word-level emotion model. Even under a very simple word-level emotion model, our results show an improvement in prediction using word-level features over using turn-level features. We find that the advantage of word-level features lies in a better prediction of longer turns.
|
|
| · |
Mihai Rotaru and Diane J. Litman (2005) “Improving Question Answering for Reading Comprehension Tests by Combining Multiple Systems”. In Proceedings of the American Association for Artificial Intelligence (AAAI) 2005 Workshop on Question Answering in Restricted Domains, Pittsburgh, PA.
[abstract]
|
|
| |
Most work on reading comprehension question answering
systems has focused on improving performance by adding
complex natural language processing (NLP) components to
such systems rather than by combining the output of
multiple systems. Our paper empirically evaluates whether
combining the outputs of seven such systems submitted as
the final projects for a graduate level class can improve over
the performance of any individual system. We present
several analyses of our combination experiments, including
performance bounds, impact of both tie-breaking methods
and ensemble size on performance, and an error analysis.
Our results, replicated using two different publicly available
reading test corpora, demonstrate the utility of system
combination via majority voting in our restricted domain
question answering task.
|
|
|
|
|
| · |
Mihai Rotaru and Diane J. Litman (2003) “Exceptionality and Natural Language Learning”. In Proceedings of the Conference on Computational Natural Language Learning (CoNNL) 2003, Edmonton, Canada.
[abstract]
|
|
| |
Previous work has argued that memory-based learning is better than abstraction-based learning for a set of language learning tasks. In this paper, we first attempt to generalize these results to a new set of language learning tasks from the area of spoken dialog systems and to a different abstraction-based learner. We then examine the utility of various exceptionality measures for predicting where one learner is better than the other. Our results show that generalization of previous results to our tasks is not so obvious and some of the exceptionality measures may be used to characterize the performance of our learners.
|
|
| · |
Viorel Negru, Calin Sandru and Mihai Rotaru (1999) “A Multiagent System for Scientific Problem Solving”. CEEMAS 1999, St. Petersburg, Russia.
|
|
|