EMNLP 2008: conference on Empirical Methods in Natural Language Processing — October 25-27, 2008 — Waikiki, Honolulu, Hawaii.

Welcome to EMNLP 2008

List of accepted papers

We received 385 submissions (not counting papers that were withdrawn or rejected without review).

We were able to accept 81 papers as talks and an additional 35 submissions as posters.

Talks

  • An Exploration of Document Impact on Graph-Based Multi-Document Summarization Xiaojun Wan
  • Complexity of finding the BLEU-optimal hypothesis in a confusion network Gregor Leusch, Evgeny Matusov and Hermann Ney
  • Acquiring Domain-Specific Dialog Information from Task-Oriented Human-Human Interaction through an Unsupervised Learning Ananlada Chotimongkol and Alexander Rudnicky
  • Dependency-based Semantic Role Labeling of PropBank Richard Johansson and Pierre Nugues
  • Question Classification using Head Words and their Hypernyms Zhiheng Huang, Marcus Thint and Zengchang Qin
  • Sentence Fusion via Dependency Graph Compression Katja Filippova and Michael Strube
  • Revisiting Readability: A Unified Framework for Predicting Text Quality Emily Pitler and Ani Nenkova
  • Ranking Reader Emotions Using Pairwise Loss Minimization and Emotional Distribution Regression Kevin Lin and Hsin-Hsi Chen
  • It's a Contradiction—no, it's not: A Case Study using Functional Relations Alan Ritter, Stephen Soderland, Doug Downey and Oren Etzioni
  • A graph-theoretic model of lexical syntactic acquisition Hinrich Schuetze and Michael Walsh
  • Construction of an Idiom Corpus and its Application to Idiom Identification based on WSD incorporating Idiom-Specific Features Chikara Hashimoto and Daisuke Kawahara
  • Part-of-Speech Tagging for English-Spanish Code-Switched Text Thamar Solorio and Yang Liu
  • N-gram Weighting: Reducing Training Data Mismatch in Cross-Domain Language Model Estimation Bo-June (Paul) Hsu and James Glass
  • Mention Detection Crossing the Language Barrier Imed Zitouni and Radu Florian
  • Integrating Multi-level Linguistic Knowledge with a Unified Framework for Mandarin Speech Recognition Xinhao Wang, Jiazhong Nie, Dingsheng Luo and Xihong Wu
  • Relative Rank Statistics for Dialog Analysis Juan Huerta
  • Attacking Decipherment Problems Optimally with Low-Order N-gram Models Sujith Ravi and Kevin Knight
  • Incorporating Temporal and Semantic Information with Eye Gaze for Automatic Word Acquisition in Multimodal Conversational Systems, Shaolin Qu and Joyce Chai
  • Online Methods for Multi-Domain Learning and Adaptation, Mark Dredze and Koby Crammer
  • A Generative Model for Parsing Natural Language to Meaning Representations Wei Lu, Hwee Tou Ng, Wee Sun Lee and Luke. S. Zettlemoyer
  • Discriminative Learning of Selectional Preference from Unlabeled Text Shane Bergsma, Dekang Lin and Randy Goebel
  • Indirect-HMM-based Hypothesis Alignment for Combining Outputs from Machine Translation Systems Xiaodong He, Mei Yang, Jianfeng Gao, Patrick Nguyen and Robert Moore
  • One-Class Clustering in the Text Domain Ron Bekkerman and Koby Crammer
  • Information Retrieval Oriented Word Segmentation based on Character Association Strength Ranking Liu Yixuan, Wang bin, Ding fan and Xu sheng
  • Adding Redundant Features for CRFs-based Sentence Sentiment Classification Jun Zhao, Kang Liu and Gen Wang
  • Probabilistic Inference for Machine Translation with Millions of Sparse Features and a Language Model Phil Blunsom and Miles Osborne
  • Mining and Modeling Relations between Formal and Informal Chinese Phrases from Web Corpora Zhifei Li and David Yarowsky
  • Specialized models and ranking for coreference resolution Pascal Denis and Jason Baldridge
  • Phrase Translation Probabilities with ITG Priors and Smoothing as Learning Objective Markos Mylonakis and Khalil Sima'an
  • Learning to Predict Code-Switching Points Thamar Solorio and Yang Liu
  • Refining Generative Language Models using Discriminative Learning Ben Sandbank
  • A Structured Vector Space Model for Word Meaning in Context Katrin Erk and Sebastian Padó
  • Revealing the Structure of Medical Dictations with Conditional Random Fields Jeremy Jancsary, Johannes Matiasek and Harald Trost
  • Sparse Multi-Scale Grammars for Discriminative Latent Variable Parsing Slav Petrov and Dan Klein
  • Summarizing Spoken and Written Conversations Gabriel Murray and Giuseppe Carenini
  • Learning with Probabilistic Features for Improved Pipeline Models Razvan Bunescu
  • Automatic inference of the temporal location of situations in Chinese text Nianwen Xue
  • An Analysis of Active Learning Strategies for Sequence Labeling Tasks Burr Settles and Mark Craven
  • Cross-Task Knowledge-Constrained Self Training Hal Daume III
  • Decomposability of Translation Metrics for Improved Evaluation and Efficient Algorithms David Chiang, Steve DeNeefe, Yee Seng Chan and Hwee Tou Ng
  • Stacking Dependency Parsers André Filipe Torres Martins, Dipanjan Das, Noah A. Smith and Eric P. Xing
  • Learning the scope of negation in biomedical texts Roser Morante, Anthony Liekens and Walter Daelemans
  • Joint Unsupervised Coreference Resolution with Markov Logic Hoifung Poon and Pedro Domingos
  • Multilingual Subjectivity Analysis Using Machine Translation Carmen Banea, Rada Mihalcea, Janyce Wiebe and Samer Hassan
  • Predicting Success in Machine Translation Alexandra Birch, Miles Osborne and Philipp Koehn
  • Regular Expression Learning for Information Extraction Yunyao Li, Rajasekar Krishnamurthy, Sriram Raghavan, Shivakumar Vaithyanathan and H. V. Jagadish
  • Syntactic Models for Structural Word Insertion and Deletion during Translation Arul Menezes and Chris Quirk
  • A noisy-channel model of rational human sentence comprehension under uncertain input Roger Levy
  • Better Binarization for the CKY Parsing Xinying Song, Shilin Ding and Chin-Yew Lin
  • Computing Word-Pair Antonymy Saif Mohammad, Bonnie Dorr and Graeme Hirst
  • Online Large-Margin Training of Syntactic and Structural Translation Features David Chiang, Yuval Marton and Philip Resnik
  • The Linguistic Structure of English Web-Search Queries Cory Barr, Rosie Jones and Moira Regelson
  • Maximum Entropy based Rule Selection Model for Syntax-based Statistical Machine Translation Zhongjun He, Qun Liu and Shouxun Lin
  • Coarse-to-Fine Syntactic Machine Translation using Language Projections Slav Petrov, Aria Haghighi and Dan Klein
  • Jointly Combining Implicit Constraints Improves Temporal Ordering Nathanael Chambers and Dan Jurafsky
  • Automatic Prediction of Parser Accuracy Sujith Ravi, Kevin Knight and Radu Soricut
  • Word Sense Disambiguation Using OntoNotes: An Empirical Study Zhi Zhong, Hwee Tou Ng and Yee Seng Chan
  • Forest-based Translation Rule Extraction Haitao Mi and Liang Huang
  • Learning with Compositional Semantics as Structural Inference for Subsentential Sentiment Analysis Yejin Choi and Claire Cardie
  • Topic-Driven Multi-Document Summarization with Encyclopedic Knowledge and Activation Spreading Vivi Nastase
  • Scaling Textual Inference to the Web Stefan Schoenmackers, Oren Etzioni and Daniel Weld
  • A Phrase-Based Alignment Model for Natural Language Inference Bill MacCartney, Michel Galley and Christopher D. Manning
  • Lattice Minimum Bayes-Risk Decoding for Statistical Machine Translation Roy Tromble, Shankar Kumar, Franz Och and Wolfgang Macherey,
  • Lattice-based Minimum Error Rate Training Wolfgang Macherey, Franz Och, Ignacio Thayer and Jakob Uszkoreit
  • Language and Translation Model Adaptation using Comparable Corpora Matthew Snover, Bonnie Dorr and Richard Schwartz
  • Graph-based Analysis of Semantic Drift in Espresso-like Bootstrapping Algorithms Mamoru Komachi, Taku Kudo, Masashi Shimbo and Yuji Matsumoto
  • Automatic Set Expansion for List Question Answering Richard C. Wang, Nico Schlaefer, William W. Cohen and Eric Nyberg
  • CoCQA: Co-Training Over Questions and Answers for Predicting Question Subjectivity Orientation Baoli Li, Yandong Liu, and Eugene Agichtein
  • Seeded Discovery of Base Relations in Large Corpora Nicholas Andrews and Naren Ramakrishnan
  • Cheap and Fast — But is it Good? Evaluating Non-Expert Annotations for Natural Language Tasks Rion Snow, Brendan O'Connor, Dan Jurafsky and Andrew Ng
  • Weakly-Supervised Acquisition of Labeled Class Instances for Open-Domain Information Extraction using Graph Random Walks Partha Pratim Talukdar, Joseph Reisinger, Marius Pasca, Deepak Ravichandran, Rahul Bhagat and Fernando Pereira
  • Syntactic Constraints on Paraphrases Extracted from Parallel Corpora Chris Callison-Burch
  • Unsupervised Multilingual Learning for POS Tagging Benjamin Snyder, Tahira Naseem, Jacob Eisenstein and Regina Barzilay
  • Learning Graph Walk Based Similarity Measures Einat Minkov and William Cohen
  • Unsupervised Models for Coreference Resolution Vincent Ng
  • A Simple and Effective Hierarchical Phrase Reordering Model Michel Galley and Christopher Manning
  • Dependency Parsing by Belief Propagation David Smith and Jason Eisner
  • Latent-Variable Modeling of String Transductions With Finite-State Methods Markus Dreyer, Jason Smith and Jason Eisner
  • Soft-Supervised Learning for Text Classification Amarnag Subramanya and Jeff Bilmes
  • Two Languages are Better than One (for Syntactic Parsing) David Burkett and Dan Klein
  • Modeling Annotators: A Generative Approach to Learning from Annotator Rationales Omar Zaidan, Jason Eisner and Christine Piatko

Posters

  • A Tale of Two Parsers: investigating and combining graph-based and transition-based dependency parsing using beam-search Yue Zhang and Stephen Clark
  • Selecting Sentences for Answering Complex Questions Yllias Chali and Shafiq Joty
  • Using Bilingual Knowledge and Ensemble Techniques for Unsupervised Chinese Sentiment Analysis Xiaojun Wan
  • Scalable Language Processing Algorithms for the Masses: A Case Study in Computing Word Co-occurrence Matrices with MapReduce Jimmy Lin
  • LTAG Dependency Parsing with Bidirectional Incremental Construction Libin Shen and Aravind Joshi
  • Online Acquisition of Japanese Unknown Morphemes using Morphological Constraints Yugo Murawaki and Sadao Kurohashi
  • Legal Docket Classification: Where Machine Learning stumbles Ramesh Nallapati and Christopher Manning
  • A Casual Conversation System Using Modality and Word Associations Retrieved from the Web Shinsuke Higuchi, Rafal Rzepka and Kenji Araki
  • HTM: A Topic Model for Hypertexts Congkai Sun, Bin Gao and Hang Li
  • Improving Chinese Semantic Role Classification with Hierarchical Feature Selection Strategy Weiwei Ding and Baobao Chang
  • Improving Interactive Machine Translation via Mouse Actions Germén Sanchis-Trilles, Daniel Ortiz-Martìez, Jorge Civera, Francisco Casacuberta, Enrique Vidal and Hieu Hoang
  • When Harry Met Harri: Cross-lingual Name Spelling Normalization Fei Huang, Ahmad Emami and Imed Zitouni
  • HotSpots: Visualizing Edits to a Text Srinivas Bangalore and David Smith
  • Triplet Lexicon Models for Statistical Machine Translation Saša Hasan, Juri Ganitkevitch, Hermann Ney and Jesus Andres Ferrer
  • Who is Who and What is What: Experiments in Cross-Document Co-Reference Alex Baron and Marjorie Freedman
  • Improved Sentence Alignment on Parallel Web Pages Using a Stochastic Tree Alignment Model Lei Shi and Ming Zhou
  • A comparison of Bayesian estimators for unsupervised Hidden Markov Model POS taggers Jianfeng Gao and Mark Johnson
  • Arabic Named Entity Recognition using Optimal Feature Sets Yassine Benajiba, Mona Diab and Paolo Rosso
  • Multilingual Subjectivity Analysis of Multiparty Conversation Stephan Raaijmakers, Khiet Truong and Theresa Wilson
  • A Discriminative Candidate Generator for String Transformations Naoaki Okazaki, Yoshimasa Tsuruoka, Sophia Ananiadou and Jun'ichi Tsujii
  • Using Topic Models to Study the History of Ideas David Hall, Daniel Jurafsky and Christopher Manning
  • Sampling Alignment Structure under a Bayesian Translation Model John DeNero, Alexandre Bouchard-Côté and Dan Klein
  • Adapting a Lexicalized-Grammar Parser to Contrasting Domains Laura Rimell and Stephen Clark
  • Generalizing Local and Non-Local Word-Reordering Patterns for Syntax-Based Machine Translation Bing Zhao and Yaser Al-onaizan
  • Bridging Lexical Gaps between Queries and Questions on Large Online Q&A Collections with Compact Translation Models Jung-Tae Lee, Sang-Bum Kim, Young-In Song and Hae-Chang Rim
  • Automatic induction of FrameNet lexical units Marco Pennacchiotti, Diego De Cao, Roberto Basili and Danilo Croce
  • Understanding the Value of Features for Coreference Resolution Eric Bengtson and Dan Roth
  • Bayesian Unsupervised Topic Segmentation Jacob Eisenstein and Regina Barzilay
  • Transliteration as Constrained Optimization Dan Goldwasser and Dan Roth
  • Seed and Grow: Augmenting Statistically Generated Summary Sentences using Schematic Word Patterns Stephen Wan, Robart Dale, Mark Dras and Cecile Paris
  • Online Word Games for Semantic Data Collection David Vickrey, Aaron Bronsan, William Choi, Aman Kumar, Jason Turner-Maier, Arthur Wang and Daphne Koller
  • A Dependency-based Word Subsequence Kernel Rohit Kate
  • A Japanese Predicate Argument Structure Analysis using Decision Lists Hirotoshi Taira, Sanae Fujita and Masaaki Nagata