Carnegie Mellon University

LTI Logo

November 11, 2020

LTI Researchers Featured Prominently at EMNLP

37 Main Conference and 19 Findings Papers Accepted from LTI-Affiliated Authors

By Bryan Burtner

Bryan Burtner

An impressive 46 papers by LTI faculty and students were accepted at the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), including 37 main conference accepted papers and 19 papers accepted to the newly added Findings publication.

Organized by the Association for Computational Linguisitcs, and now in its 25th year, EMNLP is one of the premier conferences worldwide in the fields of Natual Language Processing and Computational Linguistics. This year's conference was held virtually November 16-20.

Papers published by LTI researchers are as follows:

Main Conference Papers

A Bilingual Generative Transformer for Semantic Sentence Embedding. John Wieting, Graham Neubig and Taylor Berg-Kirkpatrick.

A Dataset for Tracking Entities in Open Domain Procedural Text. Niket Tandon, Keisuke Sakaguchi, Bhavana Dalvi, Dheeraj Rajagopal, Peter Clark, Michal Guerquin, Kyle Richardson and Eduard Hovy.

An Empirical Investigation of Contextualized Number Prediction. Taylor Berg-Kirkpatrick and Daniel Spokoyny.

Automatic Extraction of Rules Governing Morphological Agreement. Aditi Chaudhary, Antonios Anastasopoulos, Adithya Pratapa, David R. Mortensen, Zaid Sheikh, Yulia Tsvetkov and Graham Neubig.

CMU-MOSEAS: A Multimodal Language Dataset for Spanish, Portuguese, German and French. AmirAli Bagher Zadeh, Yansheng Cao, Simon Hessner, Paul Pu Liang, Soujanya Poria and Louis-Philippe Morency.

Constrained Fact Verification for FEVER. Adithya Pratapa, Sai Muralidhar Jayanthi and Kavya Nerella.

Detecting Attackable Sentences in Arguments. Yohan Jo, Seojin Bang, Emaad Manzoor, Eduard Hovy and Chris Reed.

Dynamic Data Selection and Weighting for Iterative Back-Translation. Zi-Yi Dou, Antonios Anastasopoulos and Graham Neubig.

Efficient Meta Lifelong-Learning with Limited Memory. Zirui Wang, Sanket Vaibhav Mehta, Barnabas Poczos and Jaime Carbonell.

Experience Grounds Language. Yonatan Bisk, Ari Holtzman, Jesse Thomason, Jacob Andreas, Yoshua Bengio, Joyce Chai, Mirella Lapata, Angeliki Lazaridou, Jonathan May, Aleksandr Nisnevich, Nicolas Pinto and Joseph Turian.

Extracting Implicitly Asserted Propositions in Argumentation. Yohan Jo, Jacky Visser, Chris Reed and Eduard Hovy.

Fortifying Toxic Speech Detectors Against Veiled Toxicity. Xiaochuang Han and Yulia Tsvetkov.

INSPIRED: Toward Sociable Recommendation Dialog Systems. Shirley Anugrah Hayati, Dongyeop Kang, Qingxiaoyang Zhu, Weiyan Shi and Zhou Yu.

Incorporating Multimodal Information in Open-Domain Web Keyphrase Extraction. Yansen Wang, Zhen Fan and Carolyn Rose.

Incorporating a Local Translation Mechanism into Non-autoregressive Translation. Xiang Kong, Zhisong Zhang and Eduard Hovy.

Interpretable Multi-dataset Evaluation for Named Entity Recognition. Jinlan Fu, Pengfei Liu and Graham Neubig.

Keeping Up Appearances: Computational Modeling of Face Acts in Persuasion Oriented Discussions. Ritam Dutt, Rishabh Joshi and Carolyn Rose.

Learning to Explain: Datasets and Models for Identifying Valid Reasoning Chains in Multihop Question-Answering. Harsh Jhamtani and Peter Clark.

Like hiking? You probably enjoy nature: Persona-grounded Dialog with Commonsense Expansions. Bodhisattwa Prasad Majumder, Harsh Jhamtani, Taylor Berg-Kirkpatrick and Julian McAuley.

MedFilter: Improving Extraction of Task-relevant Utterances through Integration of Discourse Structure and Ontological Knowledge. Sopan Khosla, Shikhar Vashishth, Jill Fain Lehman and Carolyn Rose.

Modularized Transfomer-based Ranking Framework. Luyu Gao, Zhuyun Dai and Jamie Callan.

Multimodal Routing: Improving Local and Global Interpretability of Multimodal Language Analysis. Yao-Hung Hubert Tsai, Martin Ma, Muqiao Yang, Ruslan Salakhutdinov and Louis-Philippe Morency.

OCR Post Correction for Endangered Language Texts. Shruti Rijhwani, Antonios Anastasopoulos and Graham Neubig.

On Negative Interference in Multilingual Models: Findings and A Meta-Learning Treatment. Zirui Wang, Zachary C. Lipton and Yulia Tsvetkov.

On the Sentence Embeddings from Pre-trained Language Models. Bohan Li, Hao Zhou, Junxian He, Mingxuan Wang, Yiming Yang and Lei Li.

Plan ahead: Self-Supervised Text Planning for Paragraph Completion Task. Dongyeop Kang and Eduard Hovy.

Pre-tokenization of Multi-word Expressions in Cross-lingual Word Embeddings. Naoki Otani, Satoru Ozaki, Xingyuan Zhao, Yucen Li, Micaelah St Johns and Lori Levin.

Re-evaluating Evaluation in Text Summarization. Manik Bhandari, Pranav Narayan Gour, Atabak Ashfaq, Pengfei Liu and Graham Neubig.

Reading Between the Lines: Exploring Infilling in Visual Narratives. Khyathi Raghavi Chandu, Ruo-Ping Dong and Alan W Black.

Reformulating Unsupervised Style Transfer as Paraphrase Generation. Kalpesh Krishna, John Wieting and Mohit Iyyer.

RethinkCWS: Is Chinese Word Segmentation a Solved Task?. Jinlan Fu, Pengfei Liu, Qi Zhang and Xuanjing Huang.

Social Media Attributions in the Context of Water Crisis. Rupak Sarkar, Sayantan Mahinder, Hirak Sarkar, Ashiqur KhudaBukhsh.

Summarizing Text on Any Aspects: A Knowledge-Informed Weakly-Supervised Approach. Bowen Tan, Lianhui Qin, Eric Xing and Zhiting Hu.

ToTTo: A Controlled Table-To-Text Generation Dataset. Ankur Parikh, Xuezhi Wang, Sebastian Gehrmann, Manaal Faruqui, Bhuwan Dhingra, Diyi Yang and Dipanjan Das.

Towards Medical Machine Reading Comprehension with Structural Knowledge and Plain Text. Dongfang Li, Baotian Hu, Qingcai Chen, Weihua Peng and Anqi Wang.

Unsupervised Discovery of Implicit Gender Bias. Anjalie Field and Yulia Tsvetkov.

X-FACTR: Multilingual Factual Knowledge Retrieval from Pretrained Language Models. Zhengbao Jiang, Antonios Anastasopoulos, Jun Araki, Haibo Ding and Graham Neubig.

Findings Papers

Adapting Open Domain Fact Extraction and Verification to COVID-FACT through In-Domain Language Modelin Zhenghao Liu | Chenyan Xiong | Zhuyun Dai | Si Sun | Maosong Sun | Zhiyuan Liu

AirConcierge: Generating Task-Oriented Dialogue via Efficient Large-Scale Knowledge Retrieva Chieh-Yang Chen | Pei-Hsin Wang | Shih-Chieh Chang | Da-Cheng Juan | Wei Wei | Jia-Yu Pan

An Empirical Exploration of Local Ordering Pre-training for Structured Predictio Zhisong Zhang | Xiang Kong | Lori Levin | Eduard Hovy

Bridging Textual and Tabular Data for Cross-Domain Text-to-SQL Semantic Parsin Xi Victoria Lin | Richard Socher | Caiming Xiong

CDEvalSumm: An Empirical Study of Cross-Dataset Evaluation for Neural Summarization System Yiran Chen | Pengfei Liu | Ming Zhong | Zi-Yi Dou | Danqing Wang | Xipeng Qiu | Xuanjing Huang

Data-to-Text Generation with Style Imitatio Shuai Lin | Wentao Wang | Zichao Yang | Xiaodan Liang | Frank F. Xu | Eric Xing | Zhiting Hu

Event-Related Bias Removal for Real-time Disaster Event Salvador Medina Maza | Evangelia Spiliopoulou | Eduard Hovy | Alexander Hauptmann

Fine-Grained Grounding for Multimodal Speech Recognitio Tejas Srinivasan | Ramon Sanabria | Florian Metze | Desmond Elliott

Improving Target-side Lexical Transfer in Multilingual Neural Machine Translatio Luyu Gao | Xinyi Wang | Graham Neubig

It’s not a Non-Issue: Negation as a Source of Error in Machine Translatio Md Mosharaf Hossain | Antonios Anastasopoulos | Eduardo Blanco | Alexis Palmer

Making Information Seeking Easier: An Improved Pipeline for Conversational Searc Vaibhav Kumar | Jamie Callan

Narrative Text Generation with a Latent Discrete Pla Harsh Jhamtani | Taylor Berg-Kirkpatrick

No Gestures Left Behind: Learning Relationships between Spoken Language and Freeform Gesture Chaitanya Ahuja | Dong Won Lee | Ryo Ishii | Louis-Philippe Morency

On Long-Tailed Phenomena in Neural Machine Translatio Vikas Raunak | Siddharth Dalmia | Vivek Gupta | Florian Metze

Question Answering with Long Multiple-Span Answer Ming Zhu | Aman Ahuja | Da-Cheng Juan | Wei Wei | Chandan K. Reddy

RMM: A Recursive Mental Model for Dialogue Navigatio Homero Roman Roman | Yonatan Bisk | Jesse Thomason | Asli Celikyilmaz | Jianfeng Gao

Weakly- and Semi-supervised Evidence Extractio Danish Pruthi | Bhuwan Dhingra | Graham Neubig | Zachary C. Lipton

What-if I ask you to explain: Explaining the effects of perturbations in procedural tex Dheeraj Rajagopal | Niket Tandon | Peter Clark | Bhavana Dalvi | Eduard Hovy

Why and when should you pool? Analyzing Pooling in Recurrent Architecture Pratyush Maini | Keshav Kolluru | Danish Pruthi | Mausam