Industry Track
“Knowledge is Power”: Constructing Knowledge Graph of Abdominal Organs and Using Them for Automatic Radiology Report Generation
Kaveri Kale, Pushpak Bhattacharyya, Aditya Shetty, Milind Gune, Kush Shrivastava, Rustom Lawyer and Spriha Biswas
“Let’s not Quote out of Context”: Unified Vision-Language Pretraining for Context Assisted Image Captioning
Abisek Rajakumar Kalarani, Pushpak Bhattacharyya, Niyati Chhaya and Sumit Shekhar
A Static Evaluation of Code Completion by Large Language Models
Hantian Ding, Varun Kumar, Yuchen Tian, Zijian Wang, Rob Kwiatkowski, Xiaopeng Li, Murali Krishna Ramanathan, Baishakhi Ray, Parminder Bhatia, Sudipta Sengupta, Dan Roth and Bing Xiang
Accurate Training of Web-based Question Answering Systems with Feedback from Ranked Users
Liang Wang, Ivano Lauriola and Alessandro Moschitti
AI Coach Assist: An Automated Approach for Call Recommendation in Contact Centers for Agent Coaching
Md Tahmid Rahman Laskar, Cheng Chen, Xue-Yong Fu, Mahsa Azizi, Shashi Bhushan and Simon Corston-Oliver
An efficient method for Natural Language Querying on Structured Data
Hanoz Bhathena, Aviral Joshi and Prateek Singh
Annotating Research Infrastructure in Scientific Papers: An NLP-driven Approach
Seyed Amin Tabatabaei, Georgios Cheirmpos, Marius Doornenbal, Alberto Zigoni, Veronique Moore and Georgios Tsatsaronis
Answering Unanswered Questions through Semantic Reformulations in Spoken QA
Pedro Faustini, Zhiyu Chen, Besnik Fetahu, Oleg Rokhlenko and Shervin Malmasi
Application-Agnostic Language Modeling for On-Device ASR
Markus Nussbaum-Thom, Lyan Verwimp and Youssef Oualil
Automated Digitization of Unstructured Medical Prescriptions
Megha Sharma, Tushar Vatsal, Srujana Merugu and Aruna Rajan
AVEN-GR: Attribute Value Extraction and Normalization using product GRaphs
Thomas Ricatte and Donato Crisostomi
BADGE: Speeding Up BERT Inference after Deployment via Block-wise Bypasses and Divergence-based Early Exiting
Wei Zhu, Peng Wang, Yuan Ni, Guotong Xie and Xiaoling Wang
Boosting Transformers and Language Models for Clinical Prediction in Immunotherapy
Zekai Chen, Mariann Micsinai Balan and Kevin Brown
Building Accurate Low Latency ASR for Streaming Voice Search in E-commerce
Abhinav Goyal and Nikesh Garera
Chemical Language Understanding Benchmark
Yunsoo Kim, Hyuk Ko, Jane Lee, Hyun Young Heo, Jinyoung Yang, Sungsoo Lee and Kyu-hwang Lee
CocaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval
Jiapeng Wang, Chengyu Wang, Xiaodan Wang, Jun Huang and Lianwen Jin
Consistent Text Categorization using Data Augmentation in e-Commerce
Noa Avigdor, Guy Horowitz, Ariel Raviv and Stav Yanovsky Daye
Constrained Policy Optimization for Controlled Self-Learning in Conversational AI Systems
Mohammad Kachuee and Sungjin Lee
Content Moderation for Evolving Policies using Binary Question Answering
Sankha Subhra Mullick, Mohan Premchand Bhambhani, Suhit Sinha, Akshat Mathur, Somya Gupta and Jidnya Shah
Context-Aware Query Rewriting for Improving Users’ Search Experience on E-commerce Websites
Simiao Zuo, Qingyu Yin, Haoming Jiang, Shaohui Xi, Bing Yin, Chao Zhang and Tuo Zhao
CUPID: Curriculum Learning Based Real-Time Prediction using Distillation
Arindam Bhattacharya, Ankith MS, Ankit Gandhi, Vijay Huddar, Atul Saroop and Rahul Bhagat
CWSeg: An Efficient and General Approach to Chinese Word Segmentation
Dedong Li, Rui Zhao and Fei Tan
DISCOSQA: A Knowledge Base Question Answering System for Space Debris based on Program Induction
Paul Darm, Antonio Valerio Miceli Barone, Shay B. Cohen and Annalisa Riccardi
Distilled Language Models are economically efficient for the enterprise. …mostly.
Kristen Howell, Gwen Christian, Pavel Fomitchov, Gitit Kehat, Julianne Marzulla, Leanne Rolston, Jadin Tredup, Ilana Zimmerman, Ethan Selfridge and Joseph Bradley
Domain-Agnostic Neural Architecture for Class Incremental Continual Learning in Document Processing Platform
Mateusz Andrzej Wójcik, Witold Kościukiewicz, Mateusz Baran, Tomasz Kajdanowicz and Adam Fryderyk Gonczarek
Domain-specific transformer models for query translation
Mandar Kulkarni, Nikesh Garera and Anusua Trivedi
Entity Contrastive Learning in a Large-Scale Virtual Assistant System
Jonathan Rubin, Jason Crowley, George Leung, Morteza Ziyadi and Maria Minakova
Evaluating Embedding APIs for Information Retrieval
Ehsan Kamalloo, Xinyu Zhang, Odunayo Ogundepo, Nandan Thakur, David Alfonso-Hermelo, Mehdi Rezagholizadeh and Jimmy Lin
Event-Centric Query Expansion in Web Search
Yanan Zhang, Weijie Cui, Yangfan Zhang, Xiaoling Bai, Zhe Zhang, Jin Ma, xiang chen and Tianhua Zhou
EvolveMT: an Ensemble MT Engine Improving Itself with Usage Only
Kamer Ali Yüksel, Ahmet Gunduz, Mohamed Al-Badrashiny and Hassan Sawaf
Exploring Zero and Few-shot Techniques for Intent Classification
Soham Parikh, Mitul Tiwari, Prashil Tumbade and Quaizar Vohra
Extracting Text Representations for Terms and Phrases in Technical Domains
Francesco Fusco and Diego Antognini
FashionKLIP: Enhancing E-Commerce Image-Text Retrieval with Fashion Multi-Modal Conceptual Knowledge Graph
Xiaodan Wang, Chengyu Wang, Lei Li, Zhixu Li, Ben Chen, Linbo Jin, Jun Huang, Yanghua Xiao and Ming Gao
Federated Learning of Gboard Language Models with Differential Privacy
Zheng Xu, Yanxiang Zhang, Galen Andrew, Christopher Choquette, Peter Kairouz, Brendan McMahan, Jesse Rosenstock and Yuanbo Zhang
Generate-then-Retrieve: Intent-Aware FAQ Retrieval in Product Search
Zhiyu Chen, Jason Choi, Besnik Fetahu, Oleg Rokhlenko and Shervin Malmasi
GKD: A General Knowledge Distillation Framework for Large-scale Pre-trained Language Model
Shicheng Tan, Weng Lam Tam, Yuanchun Wang, Wenwen Gong, Shu Zhao, Peng Zhang and Jie Tang
Hunt for Buried Treasures: Extracting Unclaimed Embodiments from Patent Specifications
Chikara Hashimoto, Gautam Kumar, Shuichiro Hashimoto and Jun Suzuki
HyperT5: Towards Compute-Efficient Korean Language Modeling
Dongju Park, Soonwon Ka, Kang Min Yoo, Gichang Lee and Jaewook Kang
Improving Knowledge Production Efficiency With Question Answering on Conversation
Changlin Yang, Siye Liu, Sen Hu, Wangshu Zhang, Teng Xu and Jing Zheng
K-pop and fake facts: from texts to smart alerting for maritime security
Maxime Prieur, Souhir Gahbiche, Guillaume Gadek, Sylvain Gatepaille, Kilian Vasnier and Valerian Justine
KAFA: Rethinking Image Ad Understanding with Knowledge-Augmented Feature Adaptation of Vision-Language Models
Zhiwei Jia, Pradyumna Narayana, Arjun Akula, Garima Pruthi, Hao Su, Sugato Basu and Varun Jampani
KG-FLIP: Knowledge-guided Fashion-domain Language-Image Pre-training for E-commerce
Qinjin Jia, Yang Liu, Daoping Wu, Shaoyuan Xu, Huidong Liu, Jinmiao Fu, Roland Vollgraf and Bryan Wang
KoSBI: A Dataset for Mitigating Social Bias Risks Towards Safer Large Language Model Applications
Hwaran Lee, Seokhee Hong, Joonsuk Park, Takyoung Kim, Gunhee Kim and Jung-Woo Ha
Label efficient semi-supervised conversational intent classification
Mandar Kulkarni, Kyung Hyuk Kim, Nikesh Lucky Garera and Anusua Trivedi
Large Scale Generative Multimodal Attribute Extraction for E-commerce Attributes
Anant Khandelwal, Happy Mittal, Shreyas Sunil Kulkarni and Deepak Gupta
Learn over Past, Evolve for Future: Forecasting Temporal Trends for Fake News Detection
Beizhe Hu, Qiang Sheng, Juan Cao, Yongchun Zhu, Danding Wang, Zhengjia Wang and Zhiwei Jin
MathPrompter: Mathematical Reasoning using Large Language Models
Shima Imani, Liang Du and Harsh Shrivastava
Mitigating the Burden of Redundant Datasets via Batch-Wise Unique Samples and Frequency-Aware Losses
Donato Crisostomi, Andrea Caciolai, Alessandro Pedrani, Kay Rottmann, Alessandro Manzotti, Enrico Palumbo and Davide Bernardi
MobileNMT: Enabling Translation in 15MB and 30ms
Ye Lin, Xiaohui Wang, Zhexi Zhang, Mingxuan Wang, Tong Xiao and Jingbo Zhu
Multi-doc Hybrid Summarization via Salient Representation Learning
Min Xiao
NAG-NER: a Unified Non-Autoregressive Generation Framework for Various NER Tasks
Xinpeng Zhang, Ming Tan, Jingfan Zhang and Wei Zhu
PLAtE: A Large-scale Dataset for List Page Web Extraction
Aidan W. San, Yuan Zhuang, Jan Bakus, Colin Lockard, David Ciemiewicz, Sandeep S. Atluri, Kevin Small, Yangfeng Ji and Heba Elfardy
pNLP-Mixer: an Efficient all-MLP Architecture for Language
Francesco Fusco, Damian Pascual, Peter Staar and Diego Antognini
Predicting Customer Satisfaction with Soft Labels for Ordinal Classification
Etienne Manderscheid and Matthias Lee
RadLing: Towards Efficient Radiology Report Understanding
Rikhiya Ghosh, Oladimeji Farri, Sanjeev Kumar Karn, Manuela Daniela Danu, Ramya Vunikili and Larisa Micu
Rapid Diffusion: Building Domain-Specific Text-to-Image Synthesizers with Fast Inference Speed
Bingyan Liu, Weifeng Lin, Zhongjie Duan, Chengyu Wang, Wu Ziheng, Zhang Zipeng, Kui Jia, Lianwen Jin, Cen Chen and Jun Huang
Reducing cohort bias in natural language understanding systems with targeted self-training scheme
Dieu-Thu Le, Gabriela Cortes Hernandez, Bei Chen and Melanie Bradford
Referring to Screen Texts with Voice Assistants
Shruti Bhargava, Anand Dhoot, Ing-Marie Jonsson, Hoang Long Nguyen, Alkesh Patel, Hong Yu and Vincent Renkens
Regression-Free Model Updates for Spoken Language Understanding
Andrea Caciolai, Verena Weber, Tobias Falke, Alessandro Pedrani and Davide Bernardi
Reliable and Interpretable Drift Detection in Streams of Short Texts
Ella Rabinovich, Matan Vetzler, Samuel Ackerman and Ateret Anaby Tavor
SaFER: A Robust and Efficient Framework for Fine-tuning BERT-based Classifier with Noisy Labels
Zhenting Qi, Xiaoyu Tan, Chao Qu, Yinghui Xu and Yuan Qi
Scalable and Safe Remediation of Defective Actions in Self-Learning Conversational Systems
Sarthak Ahuja, Mohammad Kachuee, Fatemeh Sheikholeslami, Weiqing Liu and Jaeyoung Do
Search Query Spell Correction with Weak Supervision in E-commerce
Vishal Kakkar, Chinmay Sharma, Madhura Pande and Surender Kumar
Semantic Ambiguity Detection in Sentence Classification using Task-Specific Embeddings
Jong Myoung Kim, Young-Jun Lee, Sangkeun Jung and Ho-Jin Choi
Sharing Encoder Representations across Languages, Domains and Tasks in Large-Scale Spoken Language Understanding
Jonathan Hueser, Judith Gaspers, Thomas Gueudre, Chandana Satya Prakash, Jin Cao, Daniil Sorokin, Quynh Do, Nicolas Anastassacos, Tobias Falke, Turan Gojayev, Mariusz Momotko, Denis Romasanta Rodriguez, Austin Doolittle, Kartik Balasubramaniam, Wael Hamza, Fabian Triefenbach and Patrick Lehnen
SPM: A Split-Parsing Method for Joint Multi-Intent Detection and Slot Filling
Sheng Jiang, Su Zhu, Ruisheng Cao, Qingliang Miao and Kai Yu
Tab-Cleaner: Weakly Supervised Tabular Data Cleaning via Pre-training for E-commerce Catalog
Kewei Cheng, Xian Li, Zhengyang Wang, Chenwei Zhang, Binxuan Huang, Yifan Ethan Xu, Xin Luna Dong and Yizhou Sun
Tab-CQA: A Tabular Conversational Question Answering Dataset on Financial Reports
Chuang Liu, Junzhuo Li and Deyi Xiong
Toward More Accurate and Generalizable Evaluation Metrics for Task-Oriented Dialogs
Abishek Komma, Nagesh Panyam Chandrasekarasastry, Timothy Leffel, Anuj Goyal, Angeliki Metallinou, Spyros Matsoukas and Aram Galstyan
Towards Building a Robust Toxicity Predictor
Dmitriy Bespalov, Sourav Bhabesh, Yi Xiang, Liutong Zhou and Yanjun Qi
Transferable and Efficient: Unifying Dynamic Multi-Domain Product Categorization
Shansan Gong, Zelin Zhou, Shuo Wang, Fengjiao Chen, Xiujie Song, Xuezhi Cao, Yunsen Xian and Kenny Zhu
Unified Contextual Query Rewriting
Yingxue Zhou, Jie Hao, Mukund Rungta, Yang Liu, Eunah Cho, Xing Fan, Yanbin Lu, Vishal Vasudevan, Kellen Gillespie, Zeynab Raeesy, Sawyer Shen, Edward Guo and Gokhan Tur
Weakly supervised hierarchical multi-task classification of customer questions
Jitenkumar Babubhai Rana, Promod Yenigalla, Chetan Aggarwal, Sandeep Sricharan Mukku, Manan Soni and Rashmi Patange
Weighted Contrastive Learning With False Negative Control to Help Long-tailed Product Classification
Tianqi Wang, Lei Chen, Xiaodan Zhu, Younghun Lee and Jing Gao
What, When, and How to Ground: Designing User Persona-Aware Conversational Agents for Engaging Dialogue
Deuksin Kwon, Sunwoo Lee, Ki Hyun Kim, Seojin Lee, Taeyoon Kim and Eric Davis
xPQA: Cross-Lingual Product Question Answering in 12 Languages
Xiaoyu Shen, Akari Asai, Bill Byrne and Adria de Gispert