Best Papers

ACL’23 implemented the new award policy, which aims for broader recognition of exceptional research, in particular by significantly increasing the pool of outstanding papers to 1.5-2.5% of the total submissions. So, this year we have a total of 3 best papers, 4 special awards papers (Resource Award, Social Impact Award, Reproduction Award, Theme Paper Award)—and 39 outstanding papers! Additionally, there are Area Chair Awards: the Senior Area Chairs of each track had the opportunity to nominate one of their papers for a separate award. Many thanks to our Best Paper Committee for helping us with the selection process!

This page lists all the awards and honorable mentions, as well as demo track and SRW awards. But we congratulate everybody who was considered for the award: only 1.6% papers were even nominated by the reviewers. Next year, let’s all be more generous with nominations!

Best Paper Awards

  • Do Androids Laugh at Electric Sheep? Humor “Understanding” Benchmarks from The New Yorker Caption Contest
    Jack Hessel, Ana Marasovic, Jena D. Hwang, Lillian Lee, Jeff Da, Rowan Zellers, Robert Mankoff and Yejin Choi

  • What the DAAM: Interpreting Stable Diffusion Using Cross Attention
    Raphael Tang, Linqing Liu, Akshat Pandey, Zhiying Jiang, Gefei Yang, Karun Kumar, Pontus Stenetorp, Jimmy Lin and Ferhan Ture

  • From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Models
    Shangbin Feng, Chan Young Park, Yuhan Liu and Yulia Tsvetkov

Special Awards

Reproduction Award:

Do CoNLL-2003 Named Entity Taggers Still Work Well in 2023?
Shuheng Liu and Alan Ritter

Resource Award:

When Does Translation Require Context? A Data-driven, Multilingual Exploration
Patrick Fernandes, Kayo Yin, Emmy Liu, André Martins and Graham Neubig

Social Impact Award:

Marked Personas: Using Natural Language Prompts to Measure Stereotypes in Language Models
Myra Cheng, Esin Durmus and Dan Jurafsky

Theme Paper Award:

Weaker Than You Think: A Critical Look at Weakly Supervised Learning
Dawei Zhu, Xiaoyu Shen, Marius Mosbach, Andreas Stephan and Dietrich Klakow

Outstanding Papers

  • Backpack Language Models
    John Hewitt, John Thickstun, Christopher Manning and Percy Liang
  • CAME: Confidence-guided Adaptive Memory Efficient Optimization
    Yang Luo, Xiaozhe REN, Zangwei Zheng, ZHUO JIANG, Xin Jiang and Yang You
  • Causes and Cures for Interference in Multilingual Translation
    Uri Shaham, Maha Elbayad, Vedanuj Goswami, Omer Levy and Shruti Bhosale
  • Cognitive Reframing of Negative Thoughts through Human-Language Model Interaction
    Ashish Sharma, Kevin Rushton, Inna Lin, David Wadden, Khendra Lucas, Adam Miner, Theresa Nguyen and Tim Althoff
  • Compositional Generalization without Trees using Multiset Tagging and Latent Permutations
    Matthias Lindemann, Alexander Koller and Ivan Titov
  • Considerations for meaningful sign language machine translation based on glosses
    Mathias Müller, Zifan Jiang, Amit Moryossef, Annette Rios and Sarah Ebling
  • Dense-ATOMIC: Towards Densely-connected ATOMIC with High Knowledge Coverage and Massive Multi-hop Paths
    Xiangqing Shen, Siwei Wu and Rui Xia
  • Dissecting Transformer Length Extrapolation via the Lens of Receptive Field Analysis
    Ta-Chung Chi, Ting-Han Fan, alexander rudnicky and Peter Ramadge
  • Distilling Script Knowledge from Large Language Models for Constrained Language Planning
    Siyu Yuan, Jiangjie Chen, Ziquan Fu, Xuyang Ge, Soham Shah, Charles Jankowski, Yanghua Xiao and Deqing Yang
  • Do PLMs Know and Understand Ontological Knowledge?
    Weiqi Wu, Chengyue Jiang, Yong Jiang, Pengjun Xie and Kewei Tu
  • Don’t Generate, Discriminate: A Proposal for Grounding Language Models to Real-World Environments
    Yu Gu, Xiang Deng and Yu Su
  • Extrinsic Evaluation of Machine Translation Metrics
    Nikita Moghe, Tom Sherborne, Mark Steedman and Alexandra Birch
  • Faithful Low-Resource Data-to-Text Generation through Cycle Training
    Zhuoer Wang, Marcus Collins, Nikhita Vedula, Simone Filice, Shervin Malmasi and Oleg Rokhlenko
  • Generalizing Backpropagation for Gradient-Based Interpretability
    Kevin Du, Lucas Torroba Hennigen, Niklas Stoehr, Alex Warstadt and Ryan Cotterell
  • Hexatagging: Projective Dependency Parsing as Tagging
    Afra Amini, Tianyu Liu and Ryan Cotterell
  • Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks
    Yun Tang, Anna Sun, Hirofumi Inaguma, Xinyue Chen, Ning Dong, Xutai Ma, Paden Tomasello and Juan Pino
  • Improving Pretraining Techniques for Code-Switched NLP
    Richeek Das, Sahasra Ranjan, Shreya Pathak and Preethi Jyothi
  • Knowledge Transfer in Incremental Learning for Multilingual Neural Machine Translation
    Kaiyu Huang, Peng Li, Jin Ma, Ting Yao and Yang Liu
  • Language model acceptability judgements are not always robust to context
    Koustuv Sinha, Jon Gauthier, Aaron Mueller, Kanishka Misra, Keren Fuentes, Roger Levy and Adina Williams
  • Linear Classifier: An Often-Forgotten Baseline for Text Classification
    Yu-Chen Lin, Si-An Chen, Jie-Jyun Liu and Chih-Jen Lin
  • Minding Language Models’ (Lack of) Theory of Mind: A Plug-and-Play Multi-Character Belief Tracker
    Melanie Sclar, Sachin Kumar, Peter West, Alane Suhr, Yejin Choi and Yulia Tsvetkov
  • MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning
    Zhiyang Xu, Ying Shen and Lifu Huang
  • Multilingual LLMs are Better Cross-lingual In-context Learners with Alignment
    Eshaan Tanwar, Subhabrata Dutta, Manish Borthakur and Tanmoy Chakraborty
  • Neural Machine Translation Methods for Translating Text to Sign Language Glosses
    Dele Zhu, Vera Czehmann and Eleftherios Avramidis
  • NLPositionality: Characterizing Design Biases of Datasets and Models
    Sebastin Santy, Jenny Liang, Ronan Le Bras, Katharina Reinecke and Maarten Sap
  • PeaCoK: Persona Commonsense Knowledge for Consistent and Engaging Narratives
    Silin Gao, Beatriz Borges, Soyoung Oh, Deniz Bayazit, Saya Kanno, Hiromi Wakaki, Yuki Mitsufuji and Antoine Bosselut
  • QUEST: A Retrieval Dataset of Entity-Seeking Queries with Implicit Set Operations
    Chaitanya Malaviya, Peter Shaw, Ming-Wei Chang, Kenton Lee and Kristina Toutanova
  • Question-Answering in a Low-resourced Language: Benchmark Dataset and Models for Tigrinya
    Fitsum Gaim, Wonsuk Yang, Hancheol Park and Jong Park
  • Scaling in Cognitive Modelling: a Multilingual Approach to Human Reading Times
    Andrea Gregor de Varda and Marco Marelli
  • SCOTT: Self-Consistent Chain-of-Thought Distillation
    Peifeng Wang, Zhengyang Wang, Zheng Li, Yifan Gao, Bing Yin and Xiang Ren
  • The Mechanical Bard: An Interpretable Machine Learning Approach to Shakespearean Sonnet Generation
    Edwin Agnew, Michelle Qiu, Lily Zhu, Sam Wiseman and Cynthia Rudin
  • The Tail Wagging the Dog: Dataset Construction Biases of Social Bias Benchmarks
    Nikil Selvam, Sunipa Dev, Daniel Khashabi, Tushar Khot and Kai-Wei Chang
  • Towards Zero-Shot Multilingual Transfer for Code-Switched Responses
    Ting-Wei Wu, Changsheng Zhao, Ernie Chang, Yangyang Shi, Pierce Chuang, Vikas Chandra and Biing Juang
  • Transfer and Active Learning for Dissonance Detection: Addressing the Rare-Class Challenge
    Vasudha Varadarajan, Swanie Juhng, Syeda Mahwish, Xiaoran Liu, Jonah Luby, Christian Luhmann and H. Andrew Schwartz
  • VisText: A Benchmark for Semantically Rich Chart Captioning
    Benny Tang, Angie Boggust and Arvind Satyanarayan
  • What’s the Meaning of Superhuman Performance in Today’s NLU?
    Simone Tedeschi, Johan Bos, Thierry Declerck, Jan Hajič, Daniel Hershcovich, Eduard Hovy, Alexander Koller, Simon Krek, Steven Schockaert, Rico Sennrich, Ekaterina Shutova and Roberto Navigli
  • WikiBio: a Semantic Resource for the Intersectional Analysis of Biographical Events
    Marco Antonio Stranisci, Rossana Damiano, Enrico Mensa, Viviana Patti, Daniele Radicioni and Tommaso Caselli
  • World-to-Words: Grounded Open Vocabulary Acquisition through Fast Mapping in Vision-Language Models
    Ziqiao Ma, Jiayi Pan and Joyce Chai

Area Chair Awards

Linguistic Diversity:

Small Data, Big Impact: Leveraging Minimal Data for Effective Machine Translation
Jean Maillard, Cynthia Gao, Elahe Kalbassi, Kaushik Ram Sadagopan, Vedanuj Goswami, Philipp Koehn, Angela Fan and Francisco Guzman

Sentiment Analysis, Stylistic Analysis, and Argument Mining:

StoryTrans: Non-Parallel Story Author-Style Transfer with Discourse Representations and Content Enhancing
Xuekai Zhu, Jian Guan, Minlie Huang and Juan Liu

Discourse and Pragmatics:

Resolving Indirect Referring Expressions for Entity Selection
Mohammad Javad Hosseini, Filip Radlinski, Silvia Pareti and Annie Louis

Semantics: Sentence-level Semantics, Textual Inference, and Other Areas:

ParaAMR: A Large-Scale Syntactically Diverse Paraphrase Dataset by AMR Back-Translation
Kuan-Hao Huang, Varun Iyer, I-Hung Hsu, Anoop Kumar, Kai-Wei Chang and Aram Galstyan

Question Answering:

DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering
Ella Neeman, Roee Aharoni, Or Honovich, Leshem Choshen, Idan Szpektor and Omri Abend

Semantics: Lexical:

LexSym: Compositionality as Lexical Symmetry
Ekin Akyurek and Jacob Andreas

NLP Applications:

Are You Copying My Model? Protecting the Copyright of Large Language Models for EaaS via Backdoor Watermark
Wenjun Peng, Jingwei Yi, Fangzhao Wu, Shangxi Wu, Bin Bin Zhu, Lingjuan Lyu, Binxing Jiao, Tong Xu, Guangzhong Sun and Xing Xie

Speech and Multimodality:

Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition
Yuchen Hu, Ruizhe Li, Chen Chen, Chengwei Qin, Qiu-Shi Zhu and Eng Siong Chng

Interpretability and Analysis of Models for NLP:

Entity Tracking in Language Models
Najoung Kim and Sebastian Schuster

Linguistic Theories, Cognitive Modeling, and Psycholinguistics:

Exploring How Generative Adversarial Networks Learn Phonological Representations
Jingyi Chen and Micha Elsner

Resources and Evaluation:

Tell2Design: A Dataset for Language-Guided Floor Plan Generation
Sicong Leng, Yang Zhou, Mohammed Haroon Dupty, Wee Sun Lee, Sam Joyce and Wei Lu

Multilingualism and Cross-Lingual NLP:

Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages
Ayyoob ImaniGooghari, Peiqin Lin, Amir Hossein Kargaran, Silvia Severini, Masoud Jalili Sabet, Nora Kassner, Chunlan Ma, Helmut Schmid, André Martins, François Yvon and Hinrich Schütze

Demo Track Awards

  • Best Paper Award:
    VisKoP: Visual Knowledge oriented Programming for Interactive Knowledge Base Question Answering
    Zijun Yao, YUANYONG CHEN, Xin Lv, Shulin Cao, Amy Xin, Jifan Yu, Hailong Jin, jianjun xu, Peng Zhang, Lei Hou and Juanzi Li
  • Outstanding demo paper:
    CB2: Collaborative Natural Language Interaction Research Platform
    Jacob Sharf, Mustafa Omer Gul and Yoav Artzi
  • Outstanding demo paper:
    disco: a toolkit for Distributional Control of Generative Models
    Germán Kruszewski, Jos Rozen and Marc Dymetman

Student Research Workshop Awards

  • Assessing Chain-of-Thought Reasoning against Lexical Negation: A Case Study on Syllogism
    Mengyu Ye, Tatsuki Kuribayashi, Jun Suzuki, Hiroaki Funayama, Goro Kobayashi
  • Is a Knowledge-based Response Engaging?: An Analysis on Knowledge-Grounded Dialogue with Information Source Annotation
    Takashi Kodama, Hirokazu Kiyomaru, Yin Jou Huang, Taro Okahisa, Sadao Kurohashi
  • LECO: Improving Early Exiting via Learned Exits and Comparison-based Exiting Mechanism
    Jingfan Zhang, Ming Tan, Pengyu Dai, Wei Zhu
  • How-to Guides for Specific Audiences: A Corpus and Initial Findings
    Nicola Fanton, Agnieszka Falenska, Michael Roth

Honorable Mentions

  • ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free Language Models
    Jonas Belouadi and Steffen Eger
  • DiffusionDB: A Large-scale Prompt Gallery Dataset for Text-to-Image Generative Models
    Zijie J. Wang, Evan Montoya, David Munechika, Haoyang Yang, Benjamin Hoover and Duen Horng Chau
  • DrBERT: A Robust Pre-trained Model in French for Biomedical and Clinical domains
    Yanis Labrak, Adrien Bazoge, Richard Dufour, Mickael Rouvier, Emmanuel Morin, Béatrice Daille and Pierre-Antoine Gourraud
  • Entity Tracking in Language Models
    Najoung Kim and Sebastian Schuster
  • Forgotten Knowledge: Examining the Citational Amnesia in NLP
    Janvijay Singh, Mukund Rungta, Diyi Yang and Saif Mohammad
  • From Characters to Words: Hierarchical Pre-trained Language Model for Open-vocabulary Language Understanding
    Li Sun, Florian Luisier, Kayhan Batmanghelich, Dinei Florencio and Cha Zhang
  • GIFT: Graph-Induced Fine-Tuning for Multi-Party Conversation Understanding
    Jia-Chen Gu, Zhenhua Ling, Quan Liu, Cong Liu and Guoping Hu
  • Human Inspired Progressive Alignment and Comparative Learning for Grounded Word Acquisition
    Yuwei Bao, Barrett Lattimer and Joyce Chai
  • Latent Positional Information is in the Self-Attention Variance of Transformer Language Models Without Positional Embeddings
    Ta-Chung Chi, Ting-Han Fan, Li-Wei Chen, alexander rudnicky and Peter Ramadge
  • Revisiting non-English Text Simplification: A Unified Multilingual Benchmark
    Michael Ryan, Tarek Naous and Wei Xu
  • Synthetic Text Generation with Differential Privacy: A Simple and Practical Recipe
    Xiang Yue, Huseyin Inan, Xuechen Li, Girish Kumar, Julia McAnallen, Hoda Shajari, Huan Sun, David Levitan and Robert Sim
  • Theory-Grounded Computational Text Analysis
    Arya D. McCarthy and Giovanna Maria Dora Dore
  • Towards Understanding Chain-of-Thought Prompting: An Empirical Study of What Matters
    Boshi Wang, Sewon Min, Xiang Deng, Jiaming Shen, You Wu, Luke Zettlemoyer and Huan Sun
  • UniCoRN: Unified Cognitive Signal ReconstructioN bridging cognitive signals and human language
    Nuwa Xi, Sendong Zhao, Haochun Wang, Chi Liu, Bing Qin and Ting Liu