Accepted Findings Papers

Long Papers

  • Controllable Data Augmentation for Few-Shot Text Mining with Chain-of-Thought Attribute Manipulation
    Letian Peng, Yuwei Zhang, Jingbo Shang
  • Match More, Extract Better! Hybrid Matching Model for Open Domain Web Keyphrase Extraction
    Mingyang Song, Liping Jing, Yi Feng
  • End-to-End Emotion Semantic Parsing
    Xiaotong Jiang, Zhongqing Wang, Guodong Zhou
  • Overcoming Catastrophic Forgetting by Exemplar Selection in Task-oriented Dialogue System
    Chen Chen, Ruizhe Li, Yuchen Hu, Yuanyuan Chen, Chengwei Qin, Qiang Zhang
  • Unveiling Imitation Learning: Exploring the impact of Data Falsity to Large Language Model
    Hyunsoo Cho
  • The Counterfeit Conundrum: Can Code Language Models Grasp the Nuances of Their Incorrect Generations?
    Alex Gu, Wen-Ding Li, Naman Jain, Theo X. Olausson, Celine Lee, Koushik Sen, Armando Solar-Lezama
  • CHIME: LLM-Assisted Hierarchical Organization of Scientific Studies for Literature Review Support
    Chao-Chun Hsu, Erin Bransom, Jenna Sparks, Bailey Kuehl, Chenhao Tan, David Wadden, Lucy Lu Wang, Aakanksha Naik
  • Which Side Are You On? A Multi-task Dataset for End-to-End Argument Summarisation and Evaluation
    Hao Li, Yuping Wu, Viktor Schlegel, Riza Batista-Navarro, Tharindu Madusanka, Iqra Zahid, Jiayan Zeng, Xiaochi Wang, Xinran He, Yizhi LI, Goran Nenadic
  • Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on Graphs
    Bowen Jin, Chulin Xie, Jiawei Zhang, Kashob Kumar Roy, Yu Zhang, Zheng Li, Ruirui Li, Xianfeng Tang, Suhang Wang, Yu Meng, Jiawei Han
  • Text2DB: Integration-Aware Information Extraction with Large Language Model Agents
    Yizhu Jiao, Sha Li, Sizhe Zhou, Heng Ji, Jiawei Han
  • MediSwift: Efficient Sparse Pre-trained Biomedical Language Models
    Vithursan Thangarasa, Mahmoud Salem, Shreyas Saxena, Chen-Yu Kevin Leong, Joel Hestness, Sean Lie
  • Lexicon-Level Contrastive Visual-Grounding Improves Language Modeling
    Chengxu Zhuang, Evelina Fedorenko, Jacob Andreas
  • P-TA: Using Proximal Policy Optimization to Enhance Tabular Data Augmentation via Large Language Models
    Shuo Yang, Chenchen Yuan, Yao Rong, Felix Steinbauer, Gjergji Kasneci
  • Teaching-Assistant-in-the-Loop: Improving Knowledge Distillation from Imperfect Teacher Models in Low-Budget Scenarios
    Yuhang Zhou, Wei Ai
  • Small Models are Valuable Plug-ins for Large Language Models
    Canwen Xu, Yichong Xu, Shuohang Wang, Yang Liu, Chenguang Zhu, Julian McAuley
  • Are self-explanations from Large Language Models faithful?
    Andreas Madsen, Sarath Chandar, Siva Reddy
  • ImplicitAVE: An Open-Source Dataset and Multimodal LLMs Benchmark for Implicit Attribute Value Extraction
    Henry Peng Zou, Vinay Samuel, Yue Zhou, Weizhi Zhang, Liancheng Fang, Zihe Song, Philip S. Yu, Cornelia Caragea
  • Prompt Engineering a Prompt Engineer
    Qinyuan Ye, Mohamed Ahmed, Reid Pryzant, Fereshte Khani
  • ASPIRE: Language-Guided Data Augmentation for Improving Robustness Against Spurious Correlations
    Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Utkarsh Tyagi, S Sakshi, Sanjoy Chowdhury, Dinesh Manocha
  • Tables as Texts or Images: Evaluating the Table Reasoning Ability of LLMs and MLLMs
    Naihao Deng, Zhenjie Sun, Ruiqi He, Aman Sikka, Yulong Chen, Lin Ma, Yue Zhang, Rada Mihalcea
  • Biasly: An Expert-Annotated Dataset for Subtle Misogyny Detection and Mitigation
    Brooklyn Sheppard, Anna Richter, Allison Cohen, Elizabeth Allyn Smith, Tamara Kneese, Carolyne Pelletier, Ioana Baldini, Yue Dong
  • BlendSQL: A Scalable Dialect for Unifying Hybrid Question Answering in Relational Algebra
    Parker Glenn, Parag Pravin Dakle, Liang Wang, Preethi Raghavan
  • LLM-QAT: Data-Free Quantization Aware Training for Large Language Models
    Zechun Liu, Barlas Oguz, Changsheng Zhao, Ernie Chang, Pierre Stock, Yashar Mehdad, Yangyang Shi, Raghuraman Krishnamoorthi, Vikas Chandra
  • Towards Verifiable Generation: A Benchmark for Knowledge-aware Language Model Attribution
    Xinze Li, Yixin Cao, Liangming Pan, Yubo Ma, Aixin Sun
  • Benchmarking Cognitive Biases in Large Language Models as Evaluators
    Ryan Koo, Minhwa Lee, Vipul Raheja, Jong Inn Park, Zae Myung Kim, Dongyeop Kang
  • X-Instruction: Aligning Language Model in Low-resource Languages with Self-curated Cross-lingual Instructions
    Chong Li, Wen Yang, Jiajun Zhang, Jinliang Lu, Shaonan Wang, Chengqing Zong
  • Muffin: Mitigating Unhelpfulness in Emotional Support Conversations with Multifaceted AI Feedback
    Jiashuo WANG, Chunpu Xu, Chak Tou Leong, Wenjie Li, Jing Li
  • Resonance RoPE: Improving Context Length Generalization of Large Language Models
    Suyuchen Wang, Ivan Kobyzev, Peng Lu, Mehdi Rezagholizadeh, Bang Liu
  • MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning
    Xiangru Tang, Anni Zou, Zhuosheng Zhang, Ziming Li, Yilun Zhao, Xingyao Zhang, Arman Cohan, Mark Gerstein
  • Meta-Reasoning: Semantics-Symbol Deconstruction for Large Language Models
    Yiming Wang, Zhuosheng Zhang, Pei Zhang, Baosong Yang, Rui Wang
  • DPDLLM: A Black-box Framework for Detecting Pre-training Data from Large Language Models
    Baohang Zhou, Zezhong WANG, Lingzhi Wang, Hongru WANG, Ying Zhang, Kehui Song, Xuhui Sui, Kam-Fai Wong
  • PACIT: Unlocking the Power of Examples for Better In-Context Instruction Tuning
    Tianci Xue, Ziqi Wang, Yixia Li, Yun Chen, Guanhua Chen
  • Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language Models
    Yuchen Hu, Chen Chen, Chengwei Qin, Qiushi Zhu, EngSiong Chng, Ruizhe Li
  • Towards Better Graph-based Cross-document Relation Extraction via Non-bridge Entity Enhancement and Prediction Debiasing
    Hao Yue, Shaopeng Lai, chengyiyang, Liang Zhang, Junfeng Yao, Jinsong Su
  • Large Language Models can Share Images, Too!
    Young-Jun Lee, Dokyong Lee, Joo Won Sung, Jonghwan Hyeon, Ho-Jin Choi
  • CodeM: Less Data Yields More Versatility via Ability Matrix
    Daoguang Zan, Ailun Yu, Wei Liu, Bo Shen, Shaoxin Lin, Yongshun Gong, Yafen Yao, Yan Liu, Bei Guan, Weihua Luo, Yongji Wang, Qianxiang Wang, Lizhen Cui
  • Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning
    Kung-Hsiang Huang, Mingyang Zhou, Hou Pong Chan, Yi Fung, Zhenhailong Wang, Lingyu Zhang, Shih-Fu Chang, Heng Ji
  • BIDER: Bridging Knowledge Inconsistency for Efficient Retrieval-Augmented LLMs via Key Supporting Evidence
    Jiajie Jin, Yutao Zhu, Yujia Zhou, Zhicheng Dou
  • Beyond Literal Descriptions: Understanding and Locating Open-World Objects Aligned with Human Intentions
    Wenxuan Wang, Yisi Zhang, Xingjian He, Yichen Yan, Zijia Zhao, Xinlong Wang, Jing Liu
  • Incremental Sequence Labeling: A Tale of Two Shifts
    Shengjie Qiu, Junhao Zheng, Zhen Liu, Yicheng Luo, Qianli Ma
  • How Proficient Are Large Language Models in Formal Languages? An In-Depth Insight for Knowledge Base Question Answering
    Jinxin Liu, Shulin Cao, Jiaxin Shi, Tingjian Zhang, Lunyiu Nie, Linmei Hu, Lei Hou, Juanzi Li
  • MELOV: Multimodal Entity Linking with Optimized Visual Features in Latent Space
    Xuhui Sui, Ying Zhang, Yu Zhao, Kehui Song, Baohang Zhou, Xiaojie Yuan
  • Unsupervised Distractor Generation via Large Language Model Distilling and Counterfactual Contrastive Decoding
    Fanyi Qu, Hao Sun, Yunfang Wu
  • Conversational Question Answering with Language Models Generated Reformulations over Knowledge Graph
    Lihui Liu, Blaine Hill, Boxin Du, Fei Wang, Hanghang Tong
  • Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution Step by Step
    Li Zhong, Zilong Wang, Jingbo Shang
  • Are U a Joke Master? Pun Generation via Multi-Stage Curriculum Learning towards a Humor LLM
    Yang Chen, Chong Yang, Tu Hu, Xinhao Chen, Man Lan, Li Cai, Xinlin Zhuang, Xuan Lin, Xin Lu, Aimin Zhou
  • Knowledgeable Preference Alignment for LLMs in Domain-specific Question Answering
    Yichi Zhang, Zhuo Chen, Yin Fang, Yanxi Lu, LI FANGMING, Wen Zhang, Huajun Chen
  • MARIO: MAth Reasoning with code Interpreter Output - A Reproducible Pipeline
    Minpeng Liao, Chengxi Li, Wei Luo, Wu Jing, Kai Fan
  • DiffusPoll: Conditional Text Diffusion Model for Poll Generation
    Le Cheng, Shuangyin Li
  • Implanting LLM’s Knowledge via Reading Comprehension Tree for Toxicity Detection
    Hankun Kang, Tieyun Qian
  • LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression
    Zhuoshi Pan, Qianhui Wu, Huiqiang Jiang, Menglin Xia, Xufang Luo, Jue Zhang, Qingwei Lin, Victor Rühle, Yuqing Yang, Chin-Yew Lin, H. Vicky Zhao, Lili Qiu, Dongmei Zhang
  • EconNLI: Evaluating Large Language Models on Economics Reasoning
    Yue Guo, Yi Yang
  • Better Late Than Never: Model-Agnostic Hallucination Post-Processing Framework Towards Clinical Text Summarization
    Songda Li, Yunqi Zhang, Chunyuan Deng, Yake Niu, Hui Zhao
  • Finding and Editing Multi-Modal Neurons in Pre-Trained Transformers
    Haowen Pan, Yixin Cao, Xiaozhi Wang, Xun Yang, Meng Wang
  • Controllable Text Generation with Residual Memory Transformer
    Hanqing Zhang, Si Sun, Haiming Wu, Dawei Song
  • Prompt-Based Length Controlled Generation with Multiple Control Types
    RENLONG JIE, Xiaojun Meng, Lifeng Shang, Xin Jiang, Qun Liu
  • PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain
    Liang Chen, Yichi Zhang, Shuhuai Ren, Haozhe Zhao, Zefan Cai, Yuchi Wang, Peiyi Wang, Xiangdi Meng, Tianyu Liu, Baobao Chang
  • Pearl: A Review-driven Persona-Knowledge Grounded Conversational Recommendation Dataset
    Minjin Kim, Minju Kim, Hana Kim, Beong-woo Kwak, SeongKu Kang, Youngjae Yu, Jinyoung Yeo, Dongha Lee
  • CoLLaVO: Crayon Large Language and Vision mOdel
    Byung-Kwan Lee, Beomchan Park, Chae Won Kim, Yong Man Ro
  • Modelling Variability in Human Annotator Simulation
    Wen Wu, Wenlin Chen, Chao Zhang, Phil Woodland
  • BEnQA: A Question Answering Benchmark for Bengali and English
    Sheikh Shafayat, H M QUAMRAN HASAN, Minhajur Rahman Chowdhury Mahim, Rifki Afina Putri, James Thorne, Alice Oh
  • MORE: Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning
    Wanqing Cui, Keping Bi, Jiafeng Guo, Xueqi Cheng
  • Cutting Off the Head Ends the Conflict: A Mechanism for Interpreting and Mitigating Knowledge Conflicts in Language Models
    Zhuoran Jin, Pengfei Cao, Hongbang Yuan, Yubo Chen, Jiexin Xu, Huaijun Li, Xiaojian Jiang, Kang Liu, Jun Zhao
  • BioT5+: Towards Generalized Biological Understanding with IUPAC Integration and Multi-task Tuning
    Qizhi Pei, Lijun Wu, Kaiyuan Gao, Xiaozhuan Liang, Yin Fang, Jinhua Zhu, Shufang Xie, Tao Qin, Rui Yan
  • SIBO: A Simple Booster for Parameter-Efficient Fine-Tuning
    Zhihao Wen, Jie Zhang, Yuan Fang
  • GeoEval: Benchmark for Evaluating LLMs and Multi-Modal Models on Geometry Problem-Solving
    Jiaxin Zhang, Zhong-Zhi Li, Ming-Liang Zhang, Fei Yin, Cheng-Lin Liu, Yashar Moshfeghi
  • Boosting Textural NER with Synthetic Image and Instructive Alignment
    Jiahao Wang, Wenjun Ke, Peng Wang, Hang Zhang, Dong Nie, Jiajun Liu, Guozheng Li, Ziyu Shang
  • Neurons in Large Language Models: Dead, N-gram, Positional
    Elena Voita, Javier Ferrando, Christoforos Nalmpantis
  • LLMs as Bridges: Reformulating Grounded Multimodal Named Entity Recognition
    Jinyuan Li, Han Li, Di Sun, Jiahao Wang, Wenkun Zhang, Zan Wang, Gang Pan
  • FlowVQA: Mapping Multimodal Logic in Visual Question Answering with Flowcharts
    Shubhankar Singh, Purvi Chaurasia, Yerram Varun, Pranshu Pandya, Vatsal Gupta, Vivek Gupta, Dan Roth
  • Unveiling the Achilles’ Heel of NLG Evaluators: A Unified Adversarial Framework Driven by Large Language Models
    Yiming Chen, Chen Zhang, Danqing Luo, Luis Fernando D’Haro, Robby T. Tan, Haizhou Li
  • Teacher-Student Training for Debiasing: General Permutation Debiasing for Large Language Models
    Adian Liusie, Yassir Fathullah, Mark Gales
  • Uncovering Limitations of Large Language Models in Information Seeking from Tables
    Chaoxu Pang, Yixuan Cao, Chunhao Yang, Ping Luo
  • An Ensemble-of-Experts Framework for Rehearsal-free Continual Relation Extraction
    Shen Zhou, Yongqi Li, Xin Miao, Tieyun Qian
  • Temporal Validity Change Prediction
    Georg Wenzel, Adam Jatowt
  • RIFF: Learning to Rephrase Inputs for Few-shot Fine-tuning of Language Models
    Saeed Najafi, Alona Fyshe
  • Modelling Commonsense Commonalities with Multi-Facet Concept Embeddings
    Hanane Kteich, Na Li, Usashi Chatterjee, Zied Bouraoui, Steven Schockaert
  • Revisiting Multimodal Transformers for Tabular Data with Text Fields
    Thomas Bonnier
  • ConTempo: A Unified Temporally Contrastive Framework for Temporal Relation Extraction
    Jingcheng Niu, Saifei Liao, Victoria Ng, Simon De Montigny, Gerald Penn
  • CHARP: Conversation History AwaReness Probing for Knowledge-grounded Dialogue Systems
    Abbas Ghaddar, David Alfonso-Hermelo, Philippe Langlais, Mehdi Rezagholizadeh, Boxing Chen, Prasanna Parthasarathi
  • CriticBench: Benchmarking LLMs for Critique-Correct Reasoning
    Zicheng Lin, Zhibin Gou, Tian Liang, Ruilin Luo, Haowei Liu, Yujiu Yang
  • DAFNet: Dynamic Auxiliary Fusion for Sequential Model Editing in Large Language Models
    Taolin Zhang, Qizhou Chen, Dongyang Li, Chengyu Wang, Xiaofeng He, Longtao Huang, Hui Xue’, Jun Huang
  • Controllable Text Summarization: Unraveling Challenges, Approaches, and Prospects - A Survey
    Ashok Urlana, Pruthwik Mishra, Tathagato Roy, Rahul Mishra
  • Benchmarking Large Language Models on Communicative Medical Coaching: A Dataset and a Novel System
    Hengguan Huang, Songtao Wang, Hongfu Liu, Hao Wang, Ye Wang
  • Everything of Thoughts: Defying the Law of Penrose Triangle for Thought Generation
    Ruomeng Ding, Chaoyun Zhang, Lu Wang, Yong Xu, Minghua Ma, Wei Zhang, Si Qin, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang
  • Data Augmentation using LLMs: Data Perspectives, Learning Paradigms and Challenges
    Bosheng Ding, Chengwei Qin, Ruochen Zhao, Tianze Luo, Xinze Li, Guizhen Chen, Wenhan Xia, Junjie Hu, Anh Tuan Luu, Shafiq Joty
  • CeeBERT: Cross-Domain Inference in Early Exit BERT
    Divya Jyoti Bajpai, Manjesh Kumar Hanawal
  • UNIWIZ: A Unified Large Language Model Orchestrated Wizard for Safe Knowledge Grounded Conversations
    Souvik Das, Rohini Srihari
  • VALOR-EVAL: Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models
    Haoyi Qiu, Wenbo Hu, Zi-Yi Dou, Nanyun Peng
  • Cyclical Contrastive Learning Based on Geodesic for Zero-shot Cross-lingual Spoken Language Understanding
    Xuxin Cheng, Zhihong Zhu, Bang Yang, Xianwei Zhuang, Hongxiang Li, Yuexian Zou
  • Towards Safer Large Language Models through Machine Unlearning
    Zheyuan Liu, Guangyao Dou, Zhaoxuan Tan, Yijun Tian, Meng Jiang
  • The Impact of Reasoning Step Length on Large Language Models
    Mingyu Jin, Qinkai Yu, Dong Shu, Haiyan Zhao, Wenyue Hua, Yanda Meng, Yongfeng Zhang, Mengnan Du
  • Towards Understanding Task-agnostic Debiasing Through the Lenses of Intrinsic Bias and Forgetfulness
    Guangliang Liu, Milad Afshari, Xitong Zhang, Zhiyu Xue, Avrajit Ghosh, Bidhan Bashyal, Rongrong Wang, Kristen Johnson
  • SKGSum: Structured Knowledge-Guided Document Summarization
    Qiqi Wang, Ruofan Wang, Kaiqi Zhao, Robert Amor, Benjamin Liu, Jiamou Liu, Xianda Zheng, Zijian Huang
  • Chinese Spoken Named Entity Recognition in Real-world Scenarios: Dataset and Approaches
    Shilin Zhou, Zhenghua Li, Chen Gong, Lei Zhang, Yu Hong, Min Zhang
  • Can Large Multimodal Models Uncover Deep Semantics Behind Images?
    Yixin Yang, Zheng Li, Qingxiu Dong, Heming Xia, Zhifang Sui
  • Harvesting Events from Multiple Sources: Towards a Cross-Document Event Extraction Paradigm
    Qiang Gao, Zixiang Meng, Bobo Li, Jun Zhou, Fei Li, Chong Teng, Donghong Ji
  • A Graph per Persona: Reasoning about Subjective Natural Language Descriptions
    EunJeong Hwang, Vered Shwartz, Dan Gutfreund, Veronika Thost
  • MolTC: Towards Molecular Relational Modeling In Language Models
    Junfeng Fang, Shuai Zhang, Chang Wu, Zhengyi Yang, Zhiyuan Liu, Sihang Li, Kun Wang, Wenjie Du, Xiang Wang
  • KPEval: Towards Fine-Grained Semantic-Based Keyphrase Evaluation
    Di Wu, Da Yin, Kai-Wei Chang
  • Learning Low-dimensional Multi-domain Knowledge Graph Embedding via Dual Archimedean Spirals
    Jiang Li, Xiangdong Su, Fujun Zhang, Guanglai Gao
  • LoRA Meets Dropout under a Unified Framework
    Sheng Wang, Liheng Chen, Jiyue Jiang, Boyang XUE, Lingpeng Kong, Chuan Wu
  • Enhancing Text-to-SQL Parsing through Question Rewriting and Execution-Guided Refinement
    Wenxin Mao, Ruiqi Wang, Jiyu Guo, Jichuan Zeng, Cuiyun Gao, Peiyi Han, Chuanyi Liu
  • The Knowledge Alignment Problem: Bridging Human and External Knowledge for Large Language Models
    Shuo Zhang, Liangming Pan, Junzhou Zhao, William Yang Wang
  • ChatKBQA: A Generate-then-Retrieve Framework for Knowledge Base Question Answering with Fine-tuned Large Language Models
    Haoran Luo, Haihong E, Zichen Tang, Shiyao Peng, Yikai Guo, Wentai Zhang, Chenghao Ma, Guanting Dong, Meina Song, Wei Lin, Yifan Zhu, Anh Tuan Luu
  • Achilles-Bench: A Challenging Benchmark for Low-Resource Evaluation
    Yudong Wang, Chang Ma, Qingxiu Dong, Zhifang Sui, Lingpeng Kong, Jingjing Xu
  • INTERVENOR: Prompting the Coding Ability of Large Language Models with the Interactive Chain of Repair
    Hanbin Wang, Zhenghao Liu, Shuo Wang, Ganqu Cui, Ning Ding, Zhiyuan Liu, Ge Yu
  • Context-Aware Tracking and Dynamic Introduction for Incomplete Utterance Rewriting in Extended Multi-Turn Dialogues
    Xinnan Guo, Qian Zhu, Qiuhui Shi, Xuan Lin, Liubin Wang, DaqianLi, Yongrui Chen
  • EmotionQueen: A Benchmark for Evaluating Empathy of Large Language Models
    Yuyan Chen, Songzhou Yan, Sijia Liu, Yueze Li, Yanghua Xiao
  • Plum: Prompt Learning using Metaheuristics
    Rui Pan, Shuo Xing, Shizhe Diao, Wenhe Sun, Xiang Liu, KaShun SHUM, Jipeng Zhang, Renjie Pi, Tong Zhang
  • HOTVCOM: Generating Buzzworthy Comments for Videos
    Yuyan Chen, Songzhou Yan, Qingpei Guo, Jiyuan Jia, Zhixu Li, Yanghua Xiao
  • Do Large Language Models have Problem-Solving Capability under Incomplete Information Scenarios?
    Yuyan Chen, Yueze Li, Songzhou Yan, Sijia Liu, Jiaqing Liang, Yanghua Xiao
  • Distilling Robustness into Natural Language Inference Models with Domain-Targeted Augmentation
    Joe Stacey, Marek Rei
  • Into the Unknown: Generating Geospatial Descriptions for New Environments
    Tzuf Paz-Argaman, John Palowitch, SAYALI KULKARNI, Reut Tsarfaty, Jason Michael Baldridge
  • Unpacking Tokenization: Evaluating Text Compression and its Correlation with Model Performance
    Omer Goldman, Avi Caciularu, Matan Eyal, Kris Cao, Idan Szpektor, Reut Tsarfaty
  • Length-aware Byte Pair Encoding for Mitigating Over-segmentation in Korean Machine Translation
    Jungseob Lee, Hyeonseok Moon, Seungjun Lee, Chanjun Park, Sugyeong Eo, Hyunwoong Ko, Jaehyung Seo, Seungyoon Lee, Heuiseok Lim
  • Multilingual Instruction Tuning With Just a Pinch of Multilinguality
    Uri Shaham, Jonathan Herzig, Roee Aharoni, Idan Szpektor, Reut Tsarfaty, Matan Eyal
  • M3-Embedding: Multi-Linguality, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation
    Jianlv Chen, Shitao Xiao, Peitian Zhang, Kun Luo, Defu Lian, Zheng Liu
  • Iterative Refinement of Project-Level Code Context for Precise Code Generation with Compiler Feedback
    Zhangqian Bi, Yao Wan, Zheng Wang, Hongyu Zhang, Batu Guan, Fangxin Lu, Zili Zhang, Yulei Sui, Hai Jin, Xuanhua Shi
  • An Element is Worth a Thousand Words: Enhancing Legal Case Retrieval by Incorporating Legal Elements
    Chenlong Deng, Zhicheng Dou, Yujia Zhou, Peitian Zhang, Kelong Mao
  • SoMeLVLM: A Large Vision Language Model for Social Media Processing
    Xinnong Zhang, Haoyu Kuang, Xinyi Mou, Hanjia Lyu, Kun Wu, Siming Chen, Jiebo Luo, Xuanjing Huang, zhongyu wei
  • KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models
    Jaehyung Seo, Jaewook Lee, Chanjun Park, SeongTae Hong, Seungjun Lee, Heuiseok Lim
  • NeuroPrune: A Neuro-inspired Topological Sparse Training Algorithm for Large Language Models
    Amit Dhurandhar, Tejaswini Pedapati, Ronny Luss, Soham Dan, Aurelie Lozano, Payel Das, Georgios Kollias
  • Ranking Large Language Models without Ground Truth
    Amit Dhurandhar, Rahul Nair, Moninder Singh, Elizabeth M. Daly, Karthikeyan Natesan Ramamurthy
  • Integrating Physician Diagnostic Logic into Large Language Models: Preference Learning from Process Feedback
    Chengfeng Dou, ying zhang, Zhi Jin, Wenpin Jiao, Haiyan Zhao, Yongqiang Zhao, Zhengwei Tao
  • LM-Cocktail: Resilient Tuning of Language Models via Model Merging
    Shitao Xiao, Zheng Liu, Peitian Zhang, Xingrun Xing
  • Episodic Memory Retrieval from LLMs: A Neuromorphic Mechanism to Generate Commonsense Counterfactuals for Relation Extraction
    Xin Miao, Yongqi Li, Shen Zhou, Tieyun Qian
  • SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 13 Languages
    Nedjma Ousidhoum, Shamsuddeen Hassan Muhammad, Mohamed Abdalla, Idris Abdulmumin, Ibrahim Said Ahmad, Sanchit Ahuja, Alham Fikri Aji, Vladimir Araujo, Abinew Ali Ayele, pavan baswani, Meriem Beloucif, Chris Biemann, Sofia Bourhim, Christine de Kock, Genet Shanko Dekebo, Oumaima Hourrane, Gopichand Kanumolu, Lokesh Madasu, Samuel Rutunda, Manish Shrivastava, Thamar Solorio, Nirmal Surange, Hailegnaw Getaneh Tilaye, Krishnapriya Vishnubhotla, Genta Indra Winata, Seid Muhie Yimam, Saif M. Mohammad
  • Alirector: Alignment-Enhanced Chinese Grammatical Error Corrector
    Haihui Yang, Xiaojun Quan
  • The Emotion Dynamics of Literary Novels
    Krishnapriya Vishnubhotla, Adam Hammond, Graeme Hirst, Saif M. Mohammad
  • LANS: A Layout-Aware Neural Solver for Plane Geometry Problem
    Zhong-Zhi Li, Ming-Liang Zhang, Fei Yin, Cheng-Lin Liu
  • Knowledge Crosswords: Geometric Knowledge Reasoning with Large Language Models
    Wenxuan Ding, Shangbin Feng, Yuhan Liu, Zhaoxuan Tan, Vidhisha Balachandran, Tianxing He, Yulia Tsvetkov
  • DELL: Generating Reactions and Explanations for LLM-Based Misinformation Detection
    Herun Wan, Shangbin Feng, Zhaoxuan Tan, Heng Wang, Yulia Tsvetkov, Minnan Luo
  • The Language Barrier: Dissecting Safety Challenges of LLMs in Multilingual Contexts
    Lingfeng Shen, Weiting Tan, Sihao Chen, Yunmo Chen, Jingyu Zhang, Haoran Xu, Boyuan Zheng, Philipp Koehn, Daniel Khashabi
  • Self-Specialization: Uncovering Latent Expertise within Large Language Models
    Junmo Kang, Hongyin Luo, Yada Zhu, Jacob A Hansen, James R. Glass, David Daniel Cox, Alan Ritter, Rogerio Feris, Leonid Karlinsky
  • FUSE: Measure-Theoretic Compact Fuzzy Set Representation for Taxonomy Expansion
    Fred Xu, Song Jiang, Zijie Huang, Xiao Luo, Shichang Zhang, Yuanzhou Chen, Yizhou Sun
  • Chain of Logic: Rule-Based Reasoning with Large Language Models
    Sergio Servantez, Joe Barrow, Kristian J Hammond, Rajiv Jain
  • Merging Facts, Crafting Fallacies: Evaluating the Contradictory Nature of Aggregated Factual Claims in Long-Form Generations
    Cheng-Han Chiang, Hung-yi Lee
  • Can You Learn Semantics Through Next-Word Prediction? The Case of Entailment
    William Merrill, Zhaofeng Wu, Norihito Naka, Yoon Kim, Tal Linzen
  • Simulated Misinformation Susceptibility (SMISTS): Enhancing Misinformation Research with Large Language Model Simulations
    Weicheng Ma, Chunyuan Deng, Aram Moossavi, Lili Wang, Soroush Vosoughi, Diyi Yang
  • Social Intelligence Data Infrastructure: Structuring the Present and Navigating the Future
    Minzhi Li, Weiyan Shi, Caleb Ziems, Diyi Yang
  • MODABS: Multi-Objective Learning for Dynamic Aspect-Based Summarization
    Xiaobo Guo, Soroush Vosoughi
  • Non-compositional Expression Generation and its Continual Learning
    Jianing Zhou, Suma Bhat
  • Medical Dialogue System: A Survey of Categories, Methods, Evaluation and Challenges
    Xiaoming Shi, Zeming Liu, Li Du, Yuxuan Wang, Hongru WANG, Yuhang Guo, Tong Ruan, JIE XU, Xiaofan Zhang, Shaoting Zhang
  • Direct Evaluation of Chain-of-Thought in Multi-hop Reasoning with Knowledge Graphs
    Thi Minh Vuong Nguyen, LINHAO LUO, Fatemeh Shiri, Dinh Phung, Yuan-Fang Li, Trang Vu, Gholamreza Haffari
  • Comprehensive Abstractive Comment Summarization with Dynamic Clustering and Chain of Thought
    Longyin Zhang, Bowei Zou, Jacintha Wee Yun Yi, AiTi Aw
  • Self-Supervised Position Debiasing for Large Language Models
    Zhongkun Liu, Zheng Chen, Mengqi Zhang, Zhaochun Ren, Pengjie Ren, Zhumin Chen
  • HyperCL: A Contrastive Learning Framework for Hyper-Relational Knowledge Graph Embedding with Hierarchical Ontology
    Yuhuan Lu, Weijian Yu, Xin Jing, Dingqi Yang
  • Encoding Hierarchical Schema via Concept Flow for Multifaceted Ideology Detection
    Songtao Liu, Bang Wang, Wei Xiang, Han Xu, Minghua Xu
  • Character-Level Chinese Dependency Parsing via Modeling Latent Intra-Word Structure
    Yang Hou, Zhenghua Li
  • AlignRE: An Encoding and Semantic Alignment Approach for Zero-Shot Relation Extraction
    Zehan Li, Fu Zhang, Jingwei Cheng
  • Disperse-Then-Merge: Pushing the Limits of Instruction Tuning via Alignment Tax Reduction
    Tingchen Fu, Deng Cai, Lemao Liu, Shuming Shi, Rui Yan
  • Efficient Knowledge Infusion via KG-LLM Alignment
    Zhouyu Jiang, Ling Zhong, Mengshu Sun, Jun Xu, Rui Sun, Hui Cai, SHUHAN LUO, Zhiqiang Zhang
  • Towards Precise Localization of Critical Errors in Machine Translation
    Dahyun Jung, Sugyeong Eo, Heuiseok Lim
  • LoRAPrune: Structured Pruning Meets Low-Rank Parameter-Efficient Fine-Tuning
    Mingyang Zhang, Hao Chen, Chunhua Shen, Zhen Yang, Linlin Ou, Xinyi Yu, Bohan Zhuang
  • Speculative Decoding via Early-exiting for Faster LLM Inference with Thompson Sampling Control Mechanism
    Jiahao Liu, Qifan Wang, Jingang Wang, Xunliang Cai
  • AgentTuning: Enabling Generalized Agent Abilities for LLMs
    Aohan Zeng, Mingdao Liu, Rui Lu, Bowen Wang, Xiao Liu, Yuxiao Dong, Jie Tang
  • Transition-based Opinion Generation for Aspect-based Sentiment Analysis
    Tianlai Ma, Zhongqing Wang, Guodong Zhou
  • Modeling Dynamic Topics in Chain-Free Fashion by Evolution-Tracking Contrastive Learning and Unassociated Word Exclusion
    Xiaobao Wu, Xinshuai Dong, Liangming Pan, Thong Thanh Nguyen, Anh Tuan Luu
  • A Chinese Dataset for Evaluating the Safeguards in Large Language Models
    Yuxia Wang, Zenan Zhai, Haonan Li, Xudong Han, Shom Lin, Zhenxuan ZHANG, Angela Jingru Zhao, Preslav Nakov, Timothy Baldwin
  • LLMFactor: Extracting Profitable Factors through Prompts for Explainable Stock Movement Prediction
    Meiyun Wang, Kiyoshi Izumi, Hiroki Sakaji
  • You Only Look at Screens: Multimodal Chain-of-Action Agents
    Zhuosheng Zhang, Aston Zhang
  • $\rm SP^3$: Enhancing Structured Pruning via PCA Projection
    Yuxuan Hu, Jing Zhang, Zhe Zhao, Chen Zhao, Xiaodong Chen, Cuiping Li, Hong Chen
  • GENDEX: Generative Data Augmentation Strategy Leveraging External Data for Abstractive Dialogue Summarization
    Sangwon Park, Hongseok Choi, Dongha Choi, Hyunju Lee
  • A Tale of Two Revisions: Summarizing Changes Across Document Versions
    Santosh T.Y.S.S, Natwar Modani, Apoorv Saxena
  • Refine, Align, and Aggregate: Multi-view Linguistic Features Enhancement for Aspect Sentiment Triplet Extraction
    Guixin Su, Mingmin Wu, Zhongqiang Huang, Yongcheng Zhang, Tongguan Wang, Yuxue Hu, Ying Sha
  • The Music Maestro or The Musically Challenged, A Massive Music Evaluation Benchmark for Large Language Models
    Jiajia Li, lu Yang, Mingni Tang, Chenchong, Zuchao Li, Ping Wang, hai zhao
  • PyramidInfer: Pyramid KV Cache Compression for High-throughput LLM Inference
    Dongjie Yang, Xiaodong Han, Yan Gao, Yao Hu, Shilin Zhang, hai zhao
  • From Role-Play to Drama-Interaction: An LLM Solution
    Weiqi Wu, Hongqiu Wu, Lai Jiang, Xingyuan Liu, hai zhao, Min Zhang
  • TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language Models
    Jaewoo Ahn, Taehyun Lee, Junyoung Lim, Jin-Hwa Kim, Sangdoo Yun, Hwaran Lee, Gunhee Kim
  • Red Teaming Visual Language Models
    Mukai Li, Lei Li, Yuwei Yin, Masood Ahmed, Zhenguang Liu, Qi Liu
  • Enhancing Semantic Consistency of Large Language Models through Model Editing: An Interpretability-Oriented Approach
    JINGYUAN YANG, Dapeng Chen, Yajing Sun, Rongjun Li, Zhiyong Feng, Wei Peng
  • Semantic Skill Grounding for Embodied Instruction-Following in Cross-Domain Environments
    Sangwoo Shin, SeungHyun Kim, Youngsoo Jang, Moontae Lee, Honguk Woo
  • LIRE: listwise reward enhancement for preference alignment
    Mingye Zhu, Yi Liu, Lei Zhang, Junbo Guo, Zhendong Mao
  • See It All: Contextualized Late Aggregation for 3D Dense Captioning
    Minjung Kim, Hyung Suk Lim, Seung Hwan Kim, Soonyoung Lee, Bumsoo Kim, Gunhee Kim
  • $\texttt{DARA}$: Decomposition-Alignment-Reasoning Autonomous Language Agent for Question Answering over Knowledge Graphs
    Haishuo Fang, Xiaodan Zhu, Iryna Gurevych
  • GKT: A Novel Guidance-Based Knowledge Transfer Framework For Efficient Cloud-edge Collaboration LLM Deployment
    Yao Yao, Zuchao Li, hai zhao
  • Compositional Generalization with Grounded Language Models
    Sondre Wold, Étienne Simon, Lucas Georges Gabriel Charpentier, Egor V. Kostylev, Erik Velldal, Lilja Øvrelid
  • Rethinking Negative Instances for Generative Named Entity Recognition
    Yuyang Ding, Juntao Li, Pinzheng Wang, Zecheng Tang, Yan Bowen, Min Zhang
  • WilKE: Wise-Layer Knowledge Editor for Lifelong Knowledge Editing
    Chenhui Hu, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao
  • DINER: Debiasing Aspect-based Sentiment Analysis with Multi-variable Causal Inference
    Jialong Wu, Linhai Zhang, Deyu Zhou, Guoqiang Xu
  • STAR: Constraint LoRA with Dynamic Active Learning for Data-Efficient Fine-Tuning of Large Language Models
    Linhai Zhang, Jialong Wu, Deyu Zhou, Guoqiang Xu
  • How Much Does Nonverbal Communication Conform to Entropy Rate Constancy?: A Case Study on Listener Gaze in Interaction
    Yu Wang, Yang Xu, Gabriel Skantze, Hendrik Buschmeier
  • Lost in the Source Language: How Large Language Models Evaluate the Quality of Machine Translation
    Xu Huang, Zhirui Zhang, Xiang Geng, Yichao Du, Jiajun Chen, Shujian Huang
  • Chain-of-Verification Reduces Hallucination in Large Language Models
    Shehzaad Dhuliawala, Mojtaba Komeili, Jing Xu, Roberta Raileanu, Xian Li, Asli Celikyilmaz, Jason E Weston
  • Measuring Bargaining Abilities of LLMs: A Benchmark and A Buyer-Enhancement Method
    Tian Xia, Zhiwei He, Tong Ren, Yibo Miao, Zhuosheng Zhang, Yang Yang, Rui Wang
  • DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories
    Jia Li, Ge Li, Yunfei Zhao, Yongmin Li, Huanyu Liu, Hao Zhu, Lecheng Wang, Kaibo Liu, Zheng Fang, Lanshen Wang, jiazheng ding, Xuanming Zhang, YUQI ZHU, Yihong Dong, Zhi Jin, Binhua Li, Fei Huang, Yongbin Li, Bin Gu, Mengfei Yang
  • LPNL: Scalable Link Prediction with Large Language Models
    Baolong Bi, Shenghua Liu, Yiwei Wang, Lingrui Mei, Xueqi Cheng
  • Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives
    Thong Thanh Nguyen, Yi Bin, Junbin Xiao, Leigang Qu, Yicong Li, Jay Zhangjie Wu, Cong-Duy T Nguyen, See-Kiong Ng, Anh Tuan Luu
  • Generative Input: Towards Next-Generation Input Methods Paradigm
    Keyu Ding, Yongcan Wang, Zihang Xu, Zhenzhen Jia, Enhong Chen
  • A + B: A General Generator-Reader Framework for Optimizing LLMs to Unleash Synergy Potential
    Wei Tang, Yixin Cao, Jiahao Ying, Bo Wang, Yuyue Zhao, Yong Liao, Pengyuan Zhou
  • Functional Overlap Reranking for Neural Code Generation
    Hung Quoc To, Minh Huynh Nguyen, Nghi D. Q. Bui
  • Adversarial Preference Optimization: Enhancing Your Alignment via RM-LLM Game
    Pengyu Cheng, Yifan Yang, Jian Li, Yong Dai, Tianhao Hu, peixin cao, nan du, Xiaolong Li
  • Pinpointing Diffusion Grid Noise to Enhance Aspect Sentiment Quad Prediction
    Linan ZHU, Xiangfan Chen, Xiaolei Guo, Chenwei Zhang, Zhechao Zhu, Zehai Zhou, Xiangjie Kong
  • Continual Contrastive Spoken Language Understanding
    Umberto Cappellazzo, Enrico Fini, Muqiao Yang, Daniele Falavigna, Alessio Brutti, Bhiksha Raj
  • LLM as Prompter: Low-resource Inductive Reasoning on Arbitrary Knowledge Graphs
    Kai Wang, YUWEI XU, Zhiyong Wu, Siqiang Luo
  • Unsupervised Parsing by Searching for Frequent Word Sequences among Sentences with Equivalent Predicate-Argument Structures
    Junjie Chen, Xiangheng He, Danushka Bollegala, Yusuke Miyao
  • Data-Centric Explainable Debiasing for Improving Fairness in Pre-trained Language Models
    Yingji Li, Mengnan Du, Rui Song, Xin Wang, Ying Wang
  • Knowledge-Driven Cross-Document Relation Extraction
    Monika Jain, Raghava Mutharaju, Kuldeep Singh, Ramakanth Kavuluru
  • Injecting Salesperson’s Dialogue Strategies in Large Language Models with Chain-of-Thought Reasoning
    Wen Yu Chang, Yun-Nung Chen
  • KG-Adapter: Enabling Knowledge Graph Integration in Large Language Models through Parameter-Efficient Fine-Tuning
    Shiyu Tian, Yangyang Luo, Tianze Xu, Caixia Yuan, Huixing Jiang, Chen Wei, Xiaojie Wang
  • Just Ask One More Time! Self-Agreement Improves Reasoning of Language Models in (Almost) All Scenarios
    Lei Lin, Jiayi Fu, Pengli Liu, Qingyang Li, Yan Gong, Junchen Wan, Fuzheng Zhang, Zhongyuan Wang, Di ZHANG, Kun Gai
  • Evaluating LLMs’ Mathematical Reasoning in Financial Document Question Answering
    Pragya Srivastava, Manuj Malik, Vivek Gupta, Tanuja Ganu, Dan Roth
  • Can Large Language Models Mine Interpretable Financial Factors More Effectively? A Neural-Symbolic Factor Mining Agent Model
    Zhiwei Li, Ran Song, Caihong Sun, Wei Xu, Zhengtao Yu, Ji-Rong Wen
  • Discerning and Resolving Knowledge Conflicts through Adaptive Decoding with Contextual Information-Entropy Constraint
    Xiaowei Yuan, Zhao Yang, Yequan Wang, Shengping Liu, Jun Zhao, Kang Liu
  • SALAD-Bench: A Hierarchical and Comprehensive Safety Benchmark for Large Language Models
    Lijun Li, Bowen Dong, Ruohui Wang, Xuhao Hu, Wangmeng Zuo, Dahua Lin, Yu Qiao, Jing Shao
  • Extracting and Encoding: Leveraging Large Language Models and Medical Knowledge to Enhance Radiological Text Representation
    Pablo Messina, Rene Vidal, Denis Parra, Alvaro Soto, Vladimir Araujo
  • GNNavi: Navigating the Information Flow in Large Language Models by Graph Neural Network
    Shuzhou Yuan, Ercong Nie, Michael Färber, Helmut Schmid, Hinrich Schuetze
  • M-QALM: A Benchmark to Assess Clinical Reading Comprehension and Knowledge Recall in Large Language Models via Question Answering
    Anand Subramanian, Viktor Schlegel, Abhinav Ramesh Kashyap, Thanh-Tung Nguyen, Vijay Prakash Dwivedi, Stefan Winkler
  • Autonomous Workflow for Multimodal Fine-Grained Training Assistants Towards Mixed Reality
    Jiahuan Pei, Irene Viola, Haochen Huang, Junxiao Wang, Moonisa Ahsan, Fanghua Ye, Jiang Yiming, Yao Sai, Di Wang, Zhumin Chen, Pengjie Ren, Pablo Cesar
  • Perceptions of Language Technology Failures from South Asian English Speakers
    Faye Holt, William Barr Held, Diyi Yang
  • A Mechanistic Analysis of a Transformer Trained on a Symbolic Multi-Step Reasoning Task
    Jannik Brinkmann, Abhay Sheshadri, Victor Levoso, Paul Swoboda, Christian Bartelt
  • Optimal Transport Guided Correlation Assignment for Multimodal Entity Linking
    Zefeng Zhang, Jiawei Sheng, ZHANG CHUANG, liangyunzhi, Wenyuan Zhang, Siqi Wang, Tingwen Liu
  • On Efficiently Representing Regular Languages as RNNs
    Anej Svete, Robin Chan, Ryan Cotterell
  • A Survey on Modelling Morality for Text Analysis
    Ines Reinig, Maria Becker, Ines Rehbein, Simone Paolo Ponzetto
  • Your Vision-Language Model Itself Is a Strong Filter: Towards High-Quality Instruction Tuning with Data Selection
    Ruibo Chen, Yihan Wu, Lichang Chen, Guodong Liu, Qi He, Tianyi Xiong, Chenxi Liu, Junfeng Guo, Heng Huang
  • DebugBench: Evaluating Debugging Capability of Large Language Models
    Runchu Tian, Yining Ye, Yujia Qin, Xin Cong, Yankai Lin, Yinxu Pan, Yesai Wu, Hui Haotian, Liu Weichuan, Zhiyuan Liu, Maosong Sun
  • POP-CEE: Position-oriented Prompt-tuning Model for Causal Emotion Entailment
    Zhihan Zhou, Xue Gu, Yujie Zhao, Hao Xu
  • Wav2SQL: Direct Generalizable Speech-To-SQL Parsing
    Huadai Liu, Rongjie Huang, Jinzheng He, Gang Sun, Ran Shen, Xize Cheng, Zhou Zhao
  • E2-LLM: Efficient and Extreme Length Extension of Large Language Models
    Jiaheng Liu, ZhiqiBai, Yuanxing Zhang, Zhang Chenchen, YuangZh, Ge Zhang, JiakaiWang, Haoran Que, Yukang Chen, Wenbo Su, Tiezheng Ge, Jie Fu, Wenhu Chen, Bo Zheng
  • Are Female Carpenters like Blue Bananas? A Corpus Investigation of Occupation Gender Typicality
    Da JU, Karen Ullrich, Adina Williams
  • Call Me When Necessary: LLMs can Efficiently and Faithfully Reason over Structured Environments
    Sitao Cheng, Ziyuan Zhuang, Yong Xu, Fangkai Yang, Chaoyun Zhang, Xiaoting Qin, Xiang Huang, Ling Chen, Qingwei Lin, Dongmei Zhang, Saravan Rajmohan, Qi Zhang
  • Legal Judgment Reimagined: PredEx and the Rise of Intelligent AI Interpretation in Indian Courts
    Shubham Kumar Nigam, Anurag Sharma, Danush Khanna, Noel Shallum, Kripabandhu Ghosh, Arnab Bhattacharya
  • RulE: Knowledge Graph Reasoning with Rule Embedding
    Xiaojuan Tang, Song-Chun Zhu, Yitao Liang, Muhan Zhang
  • Multi-Objective Linguistic Control of Large Language Models
    Dang Nguyen, Jiuhai Chen, Tianyi Zhou
  • Evaluating the Smooth Control of Attribute Intensity in Text Generation with LLMs
    Shang Zhou, Feng Yao, Chengyu Dong, Zihan Wang, Jingbo Shang
  • Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios
    Shijue Huang, Wanjun Zhong, Jianqiao Lu, Qi Zhu, Jiahui Gao, Weiwen Liu, Yutai Hou, Xingshan Zeng, Yasheng Wang, Lifeng Shang, Xin Jiang, Ruifeng Xu, Qun Liu
  • Do Androids Know They’re Only Dreaming of Electric Sheep?
    Sky CH-Wang, Benjamin Van Durme, Jason Eisner, Chris Kedzie
  • URG: A Unified Ranking and Generation Method for Ensembling Language Models
    Bo Lv, Chen Tang, Yanan Zhang, Xin Liu, Ping Luo, Yue Yu
  • Multi-Modal Retrieval For Large Language Model Based Speech Recognition
    Aditya Gourav, Jari Kolehmainen, Prashanth Gurunath Shivakumar, Yile Gu, Grant Strimel, Ankur Gandhe, Ariya Rastrow, Ivan Bulyko
  • LoraRetriever: Input-Aware LoRA Retrieval and Composition for Mixed Tasks in the Wild
    Ziyu Zhao, Leilei Gan, Guoyin Wang, Wangchunshu Zhou, Hongxia Yang, Kun Kuang, Fei Wu
  • ELAD: Explanation-Guided Large Language Models Active Distillation
    Yifei Zhang, Bo Pan, Chen Ling, Yuntong Hu, Liang Zhao
  • Evaluating the Elementary Multilingual Capabilities of Large Language Models with MultiQ
    Carolin Holtermann, Paul Röttger, Timm Dill, Anne Lauscher
  • The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation (RAG)
    Shenglai Zeng, Jiankun Zhang, Pengfei He, Yiding Liu, Yue Xing, Han Xu, Jie Ren, Yi Chang, Shuaiqiang Wang, Dawei Yin, Jiliang Tang
  • EmpathicStories++: A Multimodal Dataset for Empathy Towards Personal Experiences
    Jocelyn J Shen, Yubin Kim, Mohit Hulse, Wazeer Zulfikar, Sharifa Alghowinem, Cynthia Breazeal, Hae Won Park
  • MRL Parsing Without Tears: The Case of Hebrew
    Shaltiel Shmidman, Avi Shmidman, Moshe Koppel, Reut Tsarfaty
  • SyntaxShap: Syntax-aware Explainability Method for Text Generation
    Kenza Amara, Rita Sevastjanova, Mennatallah El-Assady
  • Enhancing Hyperbolic Knowledge Graph Embeddings via Lorentz Transformations
    Xiran Fan, Minghua Xu, Huiyuan Chen, Yuzhong Chen, Mahashweta Das, Hao Yang
  • Tell Me What’s Next: Textual Foresight for Generic UI Representations
    Andrea Burns, Kate Saenko, Bryan A. Plummer
  • Probing the Uniquely Identifiable Linguistic Patterns of Conversational AI Agents
    Iqra Zahid, Tharindu Madusanka, Riza Batista-Navarro, Youcheng Sun
  • The Butterfly Effect of Altering Prompts: How Small Changes and Jailbreaks Affect Large Language Model Performance
    Abel Salinas, Fred Morstatter
  • X-Shot: A Unified System to Handle Frequent, Few-shot and Zero-shot Learning Simultaneously in Classification
    Hanzi Xu, Muhao Chen, Lifu Huang, Slobodan Vucetic, Wenpeng Yin
  • SPIN: Sparsifying and Integrating Internal Neurons in Large Language Models for Text Classification
    Difan Jiao, Yilun Liu, Zhenwei Tang, Daniel Matter, Jürgen Pfeffer, Ashton Anderson
  • Decomposing Co-occurrence Matrices into Interpretable Components as Formal Concepts
    Akihiro Maeda, Takuma Torii, Shohei Hidaka
  • Two-Pronged Human Evaluation of ChatGPT Self-Correction in Radiology Report Simplification
    Ziyu Yang, Santhosh Cherian, Slobodan Vucetic
  • Planning First, Question Second: An LLM-Guided Method for Controllable Question Generation
    Kunze Li, Yu Zhang
  • RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback
    Yanming Liu, Xinyue Peng, Xuhong Zhang, Weihao Liu, Jianwei Yin, Jiannan Cao, Tianyu Du
  • MrRank: Improving Question Answering Retrieval System through Multi-Result Ranking Model
    Danupat Khamnuansin, Tawunrat Chalothorn, Ekapol Chuangsuwanich
  • Chain-of-Question: A Progressive Question Decomposition Approach for Complex Knowledge Base Question Answering
    Peng Yixing, Quan Wang, Licheng Zhang, Yi Liu, Zhendong Mao
  • Instruction Tuning with Retrieval-based Examples Ranking for Aspect-based Sentiment Analysis
    Guangmin Zheng, Jin Wang, Liang-Chih Yu, Xuejie Zhang
  • Unveiling the Truth and Facilitating Change: Towards Agent-based Large-scale Social Movement Simulation
    Xinyi Mou, zhongyu wei, Xuanjing Huang
  • Locating and Extracting Relational Concepts in Large Language Models
    Zijian Wang, Britney Whyte, Chang Xu
  • Unraveling and Mitigating Retriever Inconsistencies in Retrieval-Augmented Large Language Models
    Mingda Li, Xinyu Li, Yifan Chen, Wenfeng Xuan, Weinan Zhang
  • SenticVec: Toward Robust and Human-Centric Neurosymbolic Sentiment Analysis
    Xulang Zhang, Rui Mao, Erik Cambria
  • Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period of Large Language Models
    Chen Qian, Jie Zhang, Wei Yao, Dongrui Liu, Zhenfei Yin, Yu Qiao, Yong Liu, Jing Shao
  • Language Models can Evaluate Themselves via Probability Discrepancy
    Tingyu Xia, Bowen Yu, Yuan Wu, Yi Chang, Chang Zhou
  • Evaluating the Validity of Word-level Adversarial Attacks with Large Language Models
    Huichi Zhou, Zhaoyang Wang, Hongtao Wang, Dongping Chen, Wenhan Mu, Fangyuan Zhang
  • On the Language Encoder of Contrastive Cross-modal Models
    Mengjie Zhao, Junya Ono, Zhi Zhong, Chieh-Hsin Lai, Yuhta Takida, Naoki Murata, Wei-Hsiang Liao, Takashi Shibuya, Hiromi Wakaki, Yuki Mitsufuji
  • Your Co-Workers Matter: Evaluating Collaborative Capabilities of Language Models in Blocks World
    Guande Wu, Chen Zhao, Claudio Silva, He He
  • Anchor-based Large Language Models
    Jianhui Pang, Fanghua Ye, Derek F. Wong, Xin He, Wanshun CHEN, Longyue Wang
  • MLeVLM: Improve Multi-level Progressive Capabilities based on Multimodal Large Language Model for Medical Visual Question Answering
    Dexuan Xu, Yanyuan Chen, Jieyi Wang, Yue Huang, Hanpin Wang, Zhi Jin, Hongxing Wang, Weihua Yue, Jing He, Hang Li, Yu Huang
  • Disentangling Length from Quality in Direct Preference Optimization
    Ryan Park, Rafael Rafailov, Stefano Ermon, Chelsea Finn
  • MIKE: A New Benchmark for Fine-grained Multimodal Entity Knowledge Editing
    Jiaqi Li, Miaozeng Du, Chuanyi Zhang, Yongrui Chen, Nan Hu, Guilin Qi, Haiyun Jiang, Siyuan Cheng, Bozhong Tian
  • Reformulating Domain Adaptation of Large Language Models as Adapt-Retrieve-Revise: A Case Study on Chinese Legal Domain
    Zhen Wan, Yating Zhang, Yexiang Wang, Fei Cheng, Sadao Kurohashi
  • MemeMQA: Multimodal Question Answering for Memes via Rationale-Based Inferencing
    Siddhant Agarwal, Shivam Sharma, Preslav Nakov, Tanmoy Chakraborty
  • Improving Attributed Text Generation of Large Language Models via Preference Learning
    Dongfang Li, Zetian Sun, Baotian Hu, zhenyu liu, Xinshuo Hu, Xuebo Liu, Min Zhang
  • KOMBO: Korean Character Representations Based on the Combination Rules of Subcharacters
    SungHo Kim, Juhyeong Park, Yeachan Kim, SangKeun Lee
  • Tree-Planted Transformers: Unidirectional Transformer Language Models with Implicit Syntactic Supervision
    Ryo Yoshida, Taiga Someya, Yohei Oseki
  • Play Guessing Game with LLM: Indirect Jailbreak Attack with Implicit Clues
    Zhiyuan Chang, Mingyang Li, Yi Liu, Junjie Wang, Qing Wang, Yang Liu
  • Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical Notes
    Sunjun Kweon, Junu Kim, Jiyoun Kim, Sujeong Im, Eunbyeol Cho, Seongsu Bae, Jungwoo Oh, Gyubok Lee, Jong Hak Moon, Seng Chan You, Seungjin Baek, Chang Hoon Han, YOON BIN JUNG, Yohan Jo, Edward Choi
  • Extending Context Window of Large Language Models via Semantic Compression
    Weizhi Fei, Xueyan Niu, Pingyi Zhou, Lu Hou, Bo Bai, Lei Deng, Wei Han
  • Plausible Extractive Rationalization through Semi-Supervised Entailment Signal
    Yeo Wei Jie, Ranjan Satapathy, Erik Cambria
  • Translation Deserves Better: Analyzing Translation Artifacts in Cross-lingual Visual Question Answering
    ChaeHun Park, Koanho Lee, Hyesu Lim, Jaeseok Kim, Junmo Park, Yu-Jung Heo, Du-Seong Chang, Jaegul Choo
  • Scented-EAE: Stage-Customized Entity Type Embedding for Event Argument Extraction
    Yu Yang, Jinyu Guo, Kai Shuang, Chenrui Mao
  • Fast Randomized Low-Rank Adaptation of Pre-trained Language Models with PAC Regularization
    Zijian Lei, Dong Qian, William K. Cheung
  • SDA: Semantic Discrepancy Alignment for Text-conditioned Image Retrieval
    Yuchen Yang, Yu Wang, Yanfeng Wang
  • $Se^2$: Sequential Example Selection for In-Context Learning
    Haoyu Liu, Jianfeng Liu, Shaohan Huang, Yuefeng Zhan, Hao Sun, Weiwei Deng, Furu Wei, Qi Zhang
  • Generation Meets Verification: Accelerating Large Language Model Inference with Smart Parallel Auto-Correct Decoding
    Hanling Yi, Feng Lin, Hongbin Li, Peiyang Ning, Xiaotian Yu, Rong Xiao
  • StructEval: Deepen and Broaden Large Language Model Assessment via Structured Evaluation
    Boxi Cao, Mengjie Ren, Hongyu Lin, Xianpei Han, Feng Zhang, Junfeng Zhan, Le Sun
  • Mitigating Privacy Seesaw in Large Language Models: Augmented Privacy Neuron Editing via Activation Patching
    Xinwei Wu, Weilong Dong, Shaoyang Xu, Deyi Xiong
  • BadActs: A Universal Backdoor Defense in the Activation Space
    Biao Yi, Sishuo Chen, Yiming Li, Tong Li, Baolei Zhang, Zheli Liu
  • ReactXT: Understanding Molecular “Reaction-ship” via Reaction-Contextualized Molecule-Text Pretraining
    Zhiyuan Liu, Yaorui Shi, An Zhang, Sihang Li, Enzhi Zhang, Xiang Wang, Kenji Kawaguchi, Tat-Seng Chua
  • Multi-modal Concept Alignment Pre-training for Generative Medical Visual Question Answering
    Quan Yan, Junwen Duan, Jianxin Wang
  • Exploring Ordinality in Text Classification: A Comparative Study of Explicit and Implicit Techniques
    Siva Rajesh Kasa, Aniket Goel, Karan Gupta, Sumegh Roychowdhury, Pattisapu Nikhil Priyatam, Anish bhanushali, Prasanna Srinivasa Murthy
  • The Butterfly Effect of Model Editing: Few Edits Can Trigger Large Language Models Collapse
    Wanli Yang, Fei Sun, Xinyu Ma, Xun Liu, Dawei Yin, Xueqi Cheng
  • Can We Continually Edit Language Models? On the Knowledge Attenuation in Sequential Model Editing
    Qi Li, Xiaowen Chu
  • Before Generation, Align it! A Novel and Effective Strategy for Mitigating Hallucinations in Text-to-SQL Generation
    Ge Qu, Jinyang Li, Bowen Li, Bowen Qin, Nan Huo, Chenhao Ma, Reynold Cheng
  • Translatotron-V(ison): An End-to-End Model for In-Image Machine Translation
    Zhibin Lan, Liqiang Niu, Fandong Meng, Jie Zhou, Min Zhang, Jinsong Su
  • StatBot.Swiss: Bilingual Open Data Exploration in Natural Language
    Farhad Nooralahzadeh, Yi Zhang, Ellery Smith, Sabine Maennel, Cyril Matthey-Doret, Raphaël De Fondeville, Kurt Stockinger
  • Subtle Signatures, Strong Shields: Advancing Robust and Imperceptible Watermarking in Large Language Models
    Yubing Ren, Ping Guo, Yanan Cao, Wei Ma
  • Thinking about how to extract: Energizing LLMs’ emergence capabilities for document-level event argument extraction
    Kai Shuang, zhouji, wang qiwei, Jinyu Guo “* Improving the Robustness of Distantly-Supervised Named Entity Recognition via Uncertainty-Aware Teacher Learning and Student-Student Collaborative Learning
    Shuzheng Si, Helan Hu, Haozhe Zhao, Shuang Zeng, Kaikai An, Zefan Cai, Baobao Chang
  • SSS: Editing Factual Knowledge in Language Models towards Semantic Sparse Space
    Huazheng Wang, Haifeng Sun, Jingyu Wang, Qi Qi, Zixuan Xia, Menghao Zhang, Jianxin Liao
  • $\textit{GeoHard}$: Towards Measuring Class-wise Hardness through Modelling Class Semantics
    Fengyu Cai, Xinran Zhao, Hongming Zhang, Iryna Gurevych, Heinz Koeppl
  • Unveiling Selection Biases: Exploring Order and Token Sensitivity in Large Language Models
    Sheng-Lun Wei, Cheng-Kuang Wu, Hen-Hsen Huang, Hsin-Hsi Chen
  • ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic
    Fajri Koto, Haonan Li, Sara Shatnawi, Jad Doughman, Abdelrahman Boda Sadallah, Aisha Alraeesi, Khalid Almubarak, Zaid Alyafeai, Neha Sengupta, Shady Shehata, Nizar Habash, Preslav Nakov, Timothy Baldwin
  • On the Relationship Between RNN Hidden-State Vectors and Semantic Structures
    Edi Muskardin, Martin Tappler, Ingo Pill, Bernhard K. Aichernig, Thomas Pock
  • XMC-Agent : Dynamic Navigation over Scalable Hierarchical Index for Incremental Extreme Multi-label Classification
    yanjiang liu, Tianyun Zhong, Yaojie Lu, Hongyu Lin, Ben He, Shuheng Zhou, Huijia Zhu, Weiqiang Wang, Zhongyi Liu, Xianpei Han, Le Sun
  • Benchmarking Large Language Models on CFLUE - A Chinese Financial Language Understanding Evaluation Dataset
    Jie Zhu, Junhui Li, yalong wen, Lifan Guo
  • Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint
    Zhipeng Chen, Kun Zhou, Xin Zhao, Junchen Wan, Fuzheng Zhang, Di ZHANG, Ji-Rong Wen
  • Definition generation for lexical semantic change detection
    Mariia Fedorova, Andrey Kutuzov, Yves Scherrer
  • MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector
    Marta R. Costa-jussà, Mariano Coria Meglioli, Pierre Andrews, David Dale, Prangthip Hansanti, Elahe Kalbassi, Alexandre Mourachko, Christophe Ropers, Carleigh Wood
  • Phased Instruction Fine-Tuning for Large Language Models
    Wei Pang, Chuan Zhou, Xiao-Hua Zhou, Xiaojie Wang
  • TOREE: Evaluating Topic Relevance of Student Essays for Chinese Primary and Middle School Education
    Xinlin Zhuang, Hongyi Wu, Xinshu Shen, Peimin Yu, Gaowei Yi, Xinhao Chen, Tu Hu, Yang Chen, Yupei Ren, Yadong Zhang, Youqi Song, Binxuan Liu, Man Lan
  • Predicting the Unpredictable: Uncertainty-Aware Reasoning over Temporal Knowledge Graphs via Diffusion Process
    Yuxiang Cai, Qiao Liu, Yanglei Gan, Changlin Li, Xueyi Liu, Run Lin, Da Luo, JiayeYang
  • Asymmetric Bias in Text-to-Image Generation with Adversarial Attacks
    Haz Sameen Shahgir, Xianghao Kong, Greg Ver Steeg, Yue Dong
  • Controlled Text Generation for Large Language Model with Dynamic Attribute Graphs
    Xun Liang, Hanyu Wang, Shichao Song, Mengting Hu, Xunzhi Wang, Zhiyu li, Feiyu Xiong, Bo Tang
  • Coconut: Contextualized Commonsense Unified Transformers for Graph-Based Commonsense Augmentation of Language Models
    Jun-Hyung Park, Mingyu Lee, Junho Kim, SangKeun Lee
  • Mass-Editing Memory with Attention in Transformers: A cross-lingual exploration of knowledge
    Daniel Tamayo Mela, Aitor Gonzalez-Agirre, Javier Hernando, Marta Villegas
  • BioMistral: A Collection of Open-Source Pretrained Large Language Models for Medical Domains
    Yanis Labrak, Adrien Bazoge, Emmanuel Morin, Pierre-Antoine GOURRAUD, Mickael Rouvier, Richard Dufour
  • All Languages Matter: On the Multilingual Safety of LLMs
    Wenxuan Wang, Zhaopeng Tu, Chang Chen, Youliang Yuan, Jen-tse Huang, Wenxiang Jiao, Michael Lyu
  • LJPCheck: Functional Tests for Legal Judgment Prediction
    Yuan Zhang, Wanhong Huang, Yi Feng, Chuanyi Li, Zhiwei Fei, Jidong Ge, Bin Luo, Vincent Ng
  • CMDL: A Large-Scale Chinese Multi-Defendant Legal Judgment Prediction Dataset
    Wanhong Huang, Yi Feng, Chuanyi Li, Honghan Wu, Jidong Ge, Vincent Ng
  • Abstract Meaning Representation-Based Logic-Driven Data Augmentation for Logical Reasoning
    Qiming Bao, Alex Yuxuan Peng, Zhenyun Deng, Wanjun Zhong, Gael Gendron, Timothy Pistotti, Neset TAN, Nathan Young, Yang Chen, Yonghua Zhu, Paul Denny, Michael Witbrock, Jiamou Liu
  • CodeInsight: A Curated Dataset of Practical Coding Solutions from Stack Overflow
    Nathanaël Beau, Benoit Crabbé
  • ViHateT5: Enhancing Hate Speech Detection in Vietnamese With a Unified Text-to-Text Transformer Model
    Luan Thanh Nguyen
  • Bias in News Summarization: Measures, Pitfalls and Corpora
    Julius Steen, Katja Markert
  • When to Trust LLMs: Aligning Confidence with Response Quality
    Shuchang Tao, Liuyi Yao, Hanxing Ding, Yuexiang Xie, Qi Cao, Fei Sun, Jinyang Gao, Huawei Shen, Bolin Ding
  • Zero-shot Cross-lingual Alignment for Embedding Initialization
    Xi Ai, Zhiyong Huang
  • It takes two to borrow: a donor and a recipient. Who’s who?
    Liviu P Dinu, Ana Sabina Uban, Anca Daniela Dinu, Ioan-Bogdan Iordache, Simona Georgescu, Laurentiu Zoicas
  • Advancing Post-OCR Correction: A Comparative Study of Synthetic Data
    Shuhao Guan, Derek Greene
  • GeoAgent: To Empower LLMs using Geospatial Tools for Address Standardization
    Chenghua Huang, Shisong Chen, Zhixu Li, Jianfeng Qu, Yanghua Xiao, Jiaxin Liu, Zhigang Chen
  • HQP: A Human-Annotated Dataset for Detecting Online Propaganda
    Abdurahman Maarouf, Dominik Bär, Dominique Geissler, Stefan Feuerriegel
  • Teaching Language Models to Self-Improve by Learning from Language Feedback
    Chi Hu, Yimin Hu, Hang Cao, Tong Xiao, JingBo Zhu
  • Exploring Spatial Schema Intuitions in Large Language and Vision Models
    Philipp Wicke, Lennart Wachowiak
  • Efficient Detection of LLM-generated Texts with a Bayesian Surrogate Model
    Yibo Miao, Hongcheng Gao, Hao Zhang, Zhijie Deng
  • Decoding the Narratives: Analyzing Personal Drug Experiences Shared on Reddit
    Layla Bouzoubaa, Elham Aghakhani, Max Song, Quang Minh Trinh, Shadi Rezapour
  • Unveiling the Art of Heading Design: A Harmonious Blend of Summarization, Neology, and Algorithm
    Shaobo Cui, Yiyang Feng, Yisong Mao, Yifan Hou, Boi Faltings
  • Understanding Fine-grained Distortions in Reports of Scientific Findings
    Amelie Wuehrl, Dustin Wright, Roman Klinger, Isabelle Augenstein
  • MM-SOC: Benchmarking Multimodal Large Language Models in Social Media Platforms
    Yiqiao Jin, Minje Choi, Gaurav Verma, Jindong Wang, Srijan Kumar
  • Instances Need More Care: Rewriting Prompts for Instances with LLMs in the Loop Yields Better Zero-Shot Performance
    Saurabh Srivastava, Chengyue Huang, Weiguo Fan, Ziyu Yao
  • Benchmarking Retrieval-Augmented Generation for Medicine
    Guangzhi Xiong, Qiao Jin, Zhiyong Lu, Aidong Zhang
  • ChatMusician: Understanding and Generating Music Intrinsically with LLM
    Ruibin Yuan, Hanfeng Lin, Yi Wang, Zeyue Tian, Shangda Wu, Tianhao Shen, Ge Zhang, Yuhang Wu, Cong Liu, Ziya Zhou, Liumeng Xue, Ziyang Ma, Qin Liu, Tianyu Zheng, Yizhi LI, Yinghao Ma, Yiming Liang, Xiaowei Chi, Ruibo Liu, Zili Wang, Chenghua Lin, Qifeng Liu, Tao Jiang, Wenhao Huang, Wenhu Chen, Jie Fu, Emmanouil Benetos, Gus Xia, Roger Dannenberg, Wei Xue, Shiyin Kang, Yike Guo
  • Towards Robust Temporal Reasoning of Large Language Models via a Multi-Hop QA Dataset and Pseudo-Instruction Tuning
    Qingyu Tan, Hwee Tou Ng, Lidong Bing
  • Mind Your Format: Towards Consistent Evaluation of In-Context Learning Improvements
    Anton Voronov, Lena Wolf, Max Ryabinin
  • Knowledge Graph-Enhanced Large Language Models via Path Selection
    Haochen Liu, Song Wang, Yaochen Zhu, Yushun Dong, Jundong Li
  • OTTAWA: Optimal TransporT Adaptive Word Aligner for Hallucination and Omission Translation Errors Detection
    Chenyang Huang, Abbas Ghaddar, Ivan Kobyzev, Mehdi Rezagholizadeh, Osmar Zaiane, Boxing Chen
  • ONSEP: A Novel Online Neural-Symbolic Framework for Event Prediction Based on Large Language Model
    Xuanqing Yu, Wangtao Sun, Jingwei Li, Kang Liu, Chengbao Liu, Jie Tan
  • Speech-based Slot Filling using Large Language Models
    Guangzhi Sun, Shutong Feng, Dongcheng Jiang, Chao Zhang, Milica Gasic, Phil Woodland
  • Too Big to Fail: Larger Language Models are Disproportionately Resilient to Induction of Dementia-Related Linguistic Anomalies
    Changye Li, Zhecheng Sheng, Trevor Cohen, Serguei V. S. Pakhomov
  • TRAM: Benchmarking Temporal Reasoning for Large Language Models
    Yuqing Wang, Yun Zhao
  • Knowledge of Knowledge: Exploring Known-Unknowns Uncertainty with Large Language Models
    Alfonso Amayuelas, Kyle Wong, Liangming Pan, Wenhu Chen, William Yang Wang
  • Exploring Defeasibility in Causal Reasoning
    Shaobo Cui, Lazar Milikic, Yiyang Feng, Mete Ismayilzada, Debjit Paul, Antoine Bosselut, Boi Faltings
  • Better Synthetic Data by Retrieving and Transforming Existing Datasets
    Saumya Sandipkumar Gandhi, Ritu Gala, Vijay Viswanathan, Tongshuang Wu, Graham Neubig
  • Addressing Order Sensitivity of In-Context Demonstration Examples in Causal Language Models
    Yanzheng Xiang, Hanqi Yan, Lin Gui, Yulan He
  • Perspective Taking through Generating Responses to Conflict Situations
    Joan Plepi, Charles Welch, Lucie Flek
  • LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
    Nicholas Lee, Thanakul Wattanawong, Sehoon Kim, Karttikeya Mangalam, Sheng Shen, Gopala Anumanchipalli, Michael W. Mahoney, Kurt Keutzer, Amir Gholami
  • The Power of Summary-Source Alignments
    Ori Ernst, Ori Shapira, Aviv Slobodkin, Sharon Adar, Mohit Bansal, Jacob Goldberger, Ran Levy, Ido Dagan
  • An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models
    Gantavya Bhatt, Yifang Chen, Arnav Mohanty Das, Jifan Zhang, Sang T. Truong, Stephen Mussmann, Yinglun Zhu, Jeff Bilmes, Simon Shaolei Du, Kevin Jamieson, Jordan T. Ash, Robert D Nowak
  • Learning Multimodal Contrast with Cross-modal Memory and Reinforced Contrast Recognition
    Yuanhe Tian, Fei Xia, Yan Song
  • Text Simplification via Adaptive Teaching
    Seyed Ali Bahrainian, Jonathan Dou, Carsten Eickhoff
  • A multi-level multi-label text classification dataset of 19th century Ottoman and Russian literary and critical texts
    Gokcen Gokceoglu, Devrim Çavuşoğlu, Emre Akbas, Özen Nergis Dolcerocca
  • Whose Emotions and Moral Sentiments do Language Models Reflect?
    Zihao He, Siyi Guo, Ashwin Rao, Kristina Lerman
  • LLM can Achieve Self-Regulation via Hyperparameter Aware Generation
    Siyin Wang, Shimin Li, Tianxiang Sun, Jinlan Fu, Qinyuan Cheng, Jiasheng Ye, Junjie Ye, Xipeng Qiu, Xuanjing Huang
  • Forward-Backward Reasoning in Large Language Models for Mathematical Verification
    Weisen Jiang, Han Shi, Longhui Yu, Zhengying Liu, Yu Zhang, Zhenguo Li, James Kwok
  • Towards Uncertainty-Aware Language Agent
    Jiuzhou Han, Wray Buntine, Ehsan Shareghi
  • Detection and Positive Reconstruction of Cognitive Distortion Sentences: Mandarin Dataset and Evaluation
    Shuya Lin, Yuxiong Wang, Jonathan Dong, Shiguang NI
  • PiVe: Prompting with Iterative Verification Improving Graph-based Generative Capability of LLMs
    Jiuzhou Han, Nigel Collier, Wray Buntine, Ehsan Shareghi
  • Two-stage Generative Question Answering on Temporal Knowledge Graph Using Large Language Models
    Yifu Gao, Linbo Qiao, Zhigang Kan, Zhihua Wen, Yongquan He, Dongsheng Li
  • VISREAS: Complex Visual Reasoning with Unanswerable Questions
    Syeda Nahida Akter, Sangwu Lee, Yingshan Chang, Yonatan Bisk, Eric Nyberg
  • A Unified Generative Framework for Bilingual Euphemism Detection and Identification
    Yuxue Hu, Junsong Li, Tongguan Wang, Dongyu Su, Guixin Su, Ying Sha
  • StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing
    Gaoxiang Cong, Yuankai Qi, Liang Li, Amin Beheshti, Zhedong Zhang, Anton van den Hengel, Ming-Hsuan Yang, Chenggang Yan, Qingming Huang
  • ETAS: Zero-Shot Transformer Architecture Search via Network Trainability and Expressivity
    Jiechao Yang, Yong Liu
  • Reasoning Like a Doctor: Improving Medical Dialogue Systems via Diagnostic Reasoning Process Alignment
    Kaishuai Xu, Yi Cheng, Wenjun Hou, Qiaoyu Tan, Wenjie Li
  • ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models
    Yanan Wu, Jie Liu, Xingyuan Bu, Jiaheng Liu, Zhanhui Zhou, Yuanxing Zhang, Chenchen Zhang, ZhiqiBai, Haibin Chen, Tiezheng Ge, Wanli Ouyang, Wenbo Su, Bo Zheng
  • REInstruct: Building Instruction Data from Unlabeled Corpus
    Shu Chen, Xinyan Guan, Yaojie Lu, Hongyu Lin, Xianpei Han, Le Sun
  • Learning to Maximize Mutual Information for Chain-of-Thought Distillation
    Xin Chen, Hanxian Huang, Yanjun Gao, Yi Wang, Jishen Zhao, Ke Ding
  • PEMT: Multi-Task Correlation Guided Mixture-of-Experts Enables Parameter-Efficient Transfer Learning
    Zhisheng Lin, Han Fu, Chenghao Liu, Zhuo Li, Jianling Sun
  • MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark
    Hongwei Liu, Zilong Zheng, Yuxuan Qiao, Haodong Duan, Zhiwei Fei, Fengzhe Zhou, Wenwei Zhang, Songyang Zhang, Dahua Lin, Kai Chen
  • Identifying Semantic Induction Heads to Understand In-Context Learning
    Jie Ren, Qipeng Guo, Hang Yan, Dongrui Liu, Quanshi Zhang, Xipeng Qiu, Dahua Lin
  • Chinese Spelling Corrector Is Just a Language Learner
    Lai Jiang, Hongqiu Wu, hai zhao, Min Zhang
  • Logical Closed Loop: Uncovering Object Hallucinations in Large Vision-Language Models
    Junfei Wu, Qiang Liu, Ding Wang, Jinghao Zhang, Shu Wu, Liang Wang, Tieniu Tan
  • LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models
    Xi Chen, Songyang Zhang, Qibing Bai, Kai Chen, Satoshi Nakamura
  • Plan, Generate and Complicate: Improving Low-resource Dialogue State Tracking via Easy-to-Difficult Zero-shot Data Augmentation
    Ming Gu, Yan Yang
  • DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling
    Shanghaoran Quan
  • Comments as Natural Logic Pivots: Improve Code Generation via Comment Perspective
    Yijie Chen, Yijin Liu, Fandong Meng, Yufeng Chen, Jinan Xu, Jie Zhou
  • Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration
    Sunhao Dai, Weihao Liu, Yuqi Zhou, Liang Pang, Rongju Ruan, Gang Wang, Zhenhua Dong, Jun Xu, Ji-Rong Wen
  • Continual Dialogue State Tracking via Reason-of-Select Distillation
    Yujie Feng, Bo LIU, Xiaoyu DONG, ZEXIN LU, Li-Ming Zhan, Xiao-Ming Wu, Albert Y.S. Lam
  • Spotting AI’s Touch: Identifying LLM-Paraphrased Spans in Text
    Yafu Li, Zhilin Wang, Leyang Cui, Wei Bi, Shuming Shi, Yue Zhang
  • SoFA: Shielded On-the-fly Alignment via Priority Rule Following
    Xinyu Lu, Bowen Yu, Yaojie Lu, Hongyu Lin, Haiyang Yu, Le Sun, Xianpei Han, Yongbin Li
  • Modeling Emotional Trajectories in Written Stories Utilizing Transformers and Weakly-Supervised Learning
    Lukas Christ, Shahin Amiriparian, Manuel Milling, Ilhan Aslan, Björn Schuller
  • RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter
    Meng Cao, Haoran Tang, Jinfa Huang, Peng Jin, Can Zhang, Ruyang Liu, Long Chen, Xiaodan Liang, Li Yuan, Ge Li
  • Benchmarking and Improving Long-Text Translation with Large Language Models
    Longyue Wang, Zefeng Du, Wenxiang Jiao, Chenyang Lyu, Jianhui Pang, Leyang Cui, Kaiqiang Song, Derek F. Wong, Shuming Shi, Zhaopeng Tu
  • Personalized Topic Selection Model for Topic-Grounded Dialogue
    Shixuan Fan, Wei Wei, Xiaofei Wen, Xian-Ling Mao, Jixiong Chen, Dangyang Chen
  • Debiasing In-Context Learning by Instructing LLMs How to Follow Demonstrations
    Lvxue Li, Jiaqi Chen, Xinyu Lu, Yaojie Lu, Hongyu Lin, Shuheng Zhou, Huijia Zhu, Weiqiang Wang, Zhongyi Liu, Xianpei Han, Le Sun
  • Comparing Data Augmentation Methods for End-to-End Task-Oriented Dialog Systems
    Christos Vlachos, Themos Stafylakis, Ion Androutsopoulos
  • MS2SL: Multimodal Spoken Data-Driven Continuous Sign Language Production
    Jian Ma, Wenguan Wang, Yi Yang, Feng Zheng
  • BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language Models
    Xueliang Zhao, Xinting Huang, Tingchen Fu, Qintong Li, Shansan Gong, Lemao Liu, Wei Bi, Lingpeng Kong
  • PartialFormer: Modeling Part Instead of Whole for Machine Translation
    Tong Zheng, Bei Li, Huiwen Bao, Jiale Wang, Weiqiao Shan, Tong Xiao, JingBo Zhu
  • PACE: Improving Prompt with Actor-Critic Editing for Large Language Model
    Yihong Dong, Kangcheng Luo, Xue Jiang, Zhi Jin, Ge Li
  • Penetrative AI: Making LLMs Comprehend the Physical World
    Huatao Xu, Liying Han, Qirui Yang, Mo Li, Mani Srivastava
  • The Impact of Demonstrations on Multilingual In-Context Learning: A Multidimensional Analysis
    Miaoran Zhang, Vagrant Gautam, Mingyang Wang, Jesujoba Oluwadara Alabi, Xiaoyu Shen, Dietrich Klakow, Marius Mosbach
  • Rich Semantic Knowledge Enhanced Large Language Models for Few-shot Chinese Spell Checking
    Ming Dong, Yujing Chen, Zhang Miao, Hao Sun, Tingting He
  • An Empirical Study of In-context Learning in LLMs for Machine Translation
    Pranjal A Chitale, Jay Gala, Raj Dabre
  • ODA: Observation-Driven Agent for integrating LLMs and Knowledge Graphs
    Lei Sun, Zhengwei Tao, Youdi Li, Hiroshi Arakawa
  • A Comprehensive Study of Jailbreak Attack versus Defense for Large Language Models
    Zihao Xu, Yi Liu, Gelei Deng, Yuekang Li, Stjepan Picek
  • A Data-Driven Guided Decoding Mechanism for Diagnostic Captioning
    Panagiotis Kaliosis, John Pavlopoulos, Foivos Charalampakos, Georgios Moschovis, Ion Androutsopoulos
  • Balancing Speciality and Versatility: a Coarse to Fine Framework for Supervised Fine-tuning Large Language Model
    Hengyuan Zhang, Yanru Wu, Dawei Li, Ziqing Yang, Rui Zhao, Yong Jiang, Fei Tan
  • A Two-Agent Game for Zero-shot Relation Triplet Extraction
    Ting Xu, Haiqin Yang, Fei Zhao, Zhen Wu, Xinyu Dai
  • Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning
    Naibin Gu, Peng Fu, Xiyu Liu, Bowen Shen, Zheng Lin, Weiping Wang
  • Trust in Internal or External Knowledge? Generative Multi-Modal Entity Linking with Knowledge Retriever
    Xinwei Long, Jiali Zeng, Fandong Meng, Jie Zhou, Bowen Zhou
  • A Semantic Distance Metric Learning approach for Lexical Semantic Change Detection
    Taichi Aida, Danushka Bollegala
  • What Have We Achieved on Non-autoregressive Translation?
    Yafu Li, Huajian Zhang, Jianhao Yan, Yongjing Yin, Yue Zhang
  • Large Language Models Fall Short: Understanding Complex Relationships in Detective Narratives
    Runcong Zhao, Qinglin Zhu, Hainiu Xu, Jiazheng Li, Yuxiang Zhou, Yulan He, Lin Gui
  • DistillMIKE: Editing Distillation of Massive In-Context Knowledge Editing in Large Language Models
    Shanbao Qiao, Xuebing Liu, Seung-Hoon Na
  • Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative Decoding
    Heming Xia, Zhe Yang, Qingxiu Dong, Peiyi Wang, Yongqi Li, Tao Ge, Tianyu Liu, Wenjie Li, Zhifang Sui
  • Hierarchy-aware Biased Bound Margin Loss Function for Hierarchical Text Classification
    Gibaeg Kim, SangHun Im, Heung-Seon Oh
  • Improving Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts
    Zhuo Chen, Xinyu Wang, Yong Jiang, Pengjun Xie, Fei Huang, Kewei Tu
  • CICLe: Conformal In-Context Learning for Largescale Multi-Class Food Risk Classification
    Korbinian Randl, John Pavlopoulos, Aron Henriksson, Tony Lindgren
  • IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact
    Ruikang Liu, Haoli Bai, Haokun Lin, Yuening Li, Han Gao, Zhengzhuo Xu, Lu Hou, Jun Yao, Chun Yuan
  • Learning Adverbs with Spectral Mixture Kernels
    Tomoe Taniguchi, Daichi Mochihashi, Ichiro Kobayashi
  • E-EVAL: A Comprehensive Chinese K-12 Education Evaluation Benchmark for Large Language Models
    Jinchang Hou, Chang Ao, Haihong Wu, Xiangtao Kong, Zhigang Zheng, Daijia Tang, Chengming Li, Xiping Hu, Ruifeng Xu, Shiwen Ni, Min Yang
  • ChartAssistant: A Universal Chart Multimodal Language Model via Chart-to-Table Pre-training and Multitask Instruction Tuning
    Fanqing Meng, Wenqi Shao, Quanfeng Lu, Peng Gao, Kaipeng Zhang, Yu Qiao, Ping Luo
  • Teaching Small Language Models to Reason for Knowledge-Intensive Multi-Hop Question Answering
    Xiang Li, Shizhu He, Fangyu Lei, JunYang, Tianhuang Su, Kang Liu, Jun Zhao
  • ALaRM: Align Language Models via Hierarchical Rewards Modeling
    Yuhang Lai, Siyuan Wang, Shujun Liu, Xuanjing Huang, zhongyu wei
  • Mitigating Boundary Ambiguity and Inherent Bias for Text Classification in the Era of Large Language Models
    Zhenyi Lu, Jie Tian, Wei Wei, Xiaoye Qu, Yu Cheng, Wenfeng xie, Dangyang Chen
  • UOR: Universal Backdoor Attacks on Pre-trained Language Models
    Wei Du, Peixuan Li, Haodong Zhao, Tianjie Ju, Ge Ren, Gongshen Liu
  • Language models emulate certain cognitive profiles: An investigation of how predictability measures interact with individual differences
    Patrick Haller, Lena Sophia Bolliger, Lena Ann Jäger
  • NaturalCodeBench: Examining Coding Performance Mismatch on HumanEval and Natural User Queries
    Shudan Zhang, Hanlin Zhao, Xiao Liu, Qinkai Zheng, Zehan Qi, Xiaotao Gu, Yuxiao Dong, Jie Tang
  • LLMCrit: Teaching Large Language Models to Use Criteria
    Weizhe Yuan, Pengfei Liu, Matthias Gallé
  • Empowering cross-lingual abilities of instruction-tuned large language models by translation-following demonstrations
    Leonardo Ranaldi, Giulia Pucci, Andre Freitas
  • Ranking Entities along Conceptual Space Dimensions with LLMs: An Analysis of Fine-Tuning Strategies
    Nitesh Kumar, Usashi Chatterjee, Steven Schockaert
  • Efficient $k$-Nearest-Neighbor Machine Translation with Dynamic Retrieval
    Yan Gao, Zhiwei Cao, Zhongjian Miao, Baosong Yang, Shiyu Liu, Min Zhang, Jinsong Su
  • Symmetric Dot-Product Attention for Efficient Training of BERT Language Models
    Martin Courtois, Malte Ostendorff, Leonhard Hennig, Georg Rehm
  • Synthesizing Conversations from Unlabeled Documents using Automatic Response Segmentation
    Fanyou Wu, Weijie Xu, Chandan K. Reddy, Srinivasan H. Sengamedu
  • Can Large Language Models Follow Concept Annotation Guidelines? A Case Study on Scientific and Financial Domains
    Marcio Fonseca, Shay B Cohen
  • Alignment-Based Decoding Policy for Low-Latency and Anticipation-Free Neural Japanese Input Method Editors
    Armin Sarhangzadeh, Taro Watanabe
  • ECoK: Emotional Commonsense Knowledge Graph for Mining Emotional Gold
    Zhunheng Wang, Xiaoyi Liu, Mengting Hu, Rui Ying, Ming Jiang, Jianfeng Wu, Yalan Xie, Hang Gao, Renhong Cheng
  • Deterministic Reversible Data Augmentation for Neural Machine Translation
    Jiashu Yao, Heyan Huang, Zeming Liu, Yuhang Guo
  • Latent Learningscape Guided In-context Learning
    Anlai Zhou, Sunshine Jiang, Yifei Liu, Yiquan Wu, Kun Kuang, Jun Xiao
  • SMR: State Memory Replay for Long Sequence Modeling
    Biqing Qi, Junqi Gao, Kaiyan Zhang, Dong Li, Jianxing Liu, Ligang Wu, Bowen Zhou
  • Characterizing Large Language Models as Rationalizers of Knowledge-intensive Tasks
    Aditi Mishra, Sajjadur Rahman, Kushan Mitra, Hannah Kim, Estevam Hruschka
  • Challenging Large Language Models with New Tasks: A Study on their Adaptability and Robustness
    CHENXI LI, Yuanhe Tian, Zhaxi Zerong, Yan Song, Fei Xia
  • LLMs Beyond English: Scaling the Multilingual Capability of LLMs with Cross-Lingual Feedback
    Wen Lai, Mohsen Mesgar, Alexander Fraser
  • BASS: Batched Attention-optimized Speculative Sampling
    Haifeng Qian, Sujan Kumar Gonugondla, Sungsoo Ha, Mingyue Shang, Sanjay Krishna Gouda, Ramesh Nallapati, Sudipta Sengupta, Xiaofei Ma, Anoop Deoras
  • Deciphering Digital Detectives: Understanding LLM Behaviors and Capabilities in Multi-Agent Mystery Games
    Dekun Wu, Haochen Shi, Zhiyuan Sun, Bang Liu
  • It Is Not About What You Say, It Is About How You Say It: A Surprisingly Simple Approach for Improving Reading Comprehension
    Sagi Shaier, Lawrence Hunter, Katharina von der Wense
  • Large Language Models Relearn Removed Concepts
    Michelle Wai Man Lo, Fazl Barez, Shay B Cohen
  • Towards Unified Task Embeddings Across Multiple Models: Bridging the Gap for Prompt-Based Large Language Models and Beyond
    Xinyu Wang, Hainiu Xu, Lin Gui, Yulan He
  • TOAD: Task-Oriented Automatic Dialogs with Diverse Response Styles
    Yinhong Liu, Yimai Fang, David Vandyke, Nigel Collier
  • Machine-Generated Text Localization
    Zhongping Zhang, Wenda Qin, Bryan A. Plummer
  • BenchIE^FL: A Manually Re-Annotated Fact-Based Open Information Extraction Benchmark
    Fabrice Lamarche, Philippe Langlais
  • CausalCite: A Causal Formulation of Paper Citations
    Ishan Kumar Agrawal, Zhijing Jin, Ehsan Mokhtarian, Siyuan Guo, Yuen Chen, Mrinmaya Sachan, Bernhard Schölkopf
  • Question Translation Training for Better Multilingual Reasoning
    Wenhao Zhu, Shujian Huang, Fei Yuan, Shuaijie She, Jiajun Chen, Alexandra Birch
  • Improving LLM Generations via Fine-Grained Self-Endorsement
    Ante Wang, Linfeng Song, Baolin Peng, Lifeng Jin, Ye Tian, Haitao Mi, Jinsong Su, Dong Yu
  • Multi-Label Classification for Implicit Discourse Relation Recognition
    Wanqiu Long, Siddharth N, Bonnie Webber
  • StudentEval: A Benchmark of Student-Written Prompts for Large Language Models of Code
    Hannah McLean Babe, Sydney Nguyen, Yangtian Zi, Arjun Guha, Molly Q Feldman, Carolyn Jane Anderson
  • ProLex: A Benchmark for Language Proficiency-oriented Lexical Substitution
    Xuanming Zhang, Zixun Chen, Zhou Yu
  • Generating Diverse and High-Quality Texts by Minimum Bayes Risk Decoding
    Yuu Jinnai, Ukyo Honda, Tetsuro Morimura, Peinan Zhang
  • GATE X-E : A Challenge Set for Gender-Fair Translations from Weakly-Gendered Languages
    Spencer Rarrick, Ranjita Naik, Sundar Poudel, Vishal Chowdhary
  • Hyperparameter-Free Approach for Faster Minimum Bayes Risk Decoding
    Yuu Jinnai, Kaito Ariu
  • Simplifying Translations for Children: Iterative Simplification Considering Age of Acquisition with LLMs
    Masashi Oshika, Makoto Morishita, Tsutomu Hirao, Ryohei Sasano, Koichi Takeda
  • Bi-Chainer: Automated Large Language Models Reasoning with Bidirectional Chaining
    Shuqi LIU, Bowei He, Linqi Song
  • Can Large Language Model Summarizers Adapt to Diverse Scientific Communication Goals?
    Marcio Fonseca, Shay B Cohen
  • Knowledge Context Modeling with Pre-trained Language Models for Contrastive Knowledge Graph Completion
    Guangqian Yang, Yi Liu, Lei Zhang, Licheng Zhang, Hongtao Xie, Zhendong Mao
  • Stronger, Lighter, Better: Towards Life-Long Attribute Value Extraction for E-Commerce Products
    TAO ZHANG, Chenwei Zhang, Xian Li, Jingbo Shang, Hoang H Nguyen, Philip S. Yu
  • Generalized Category Discovery with Large Language Models in the Loop
    Wenbin An, Wenkai Shi, Feng Tian, Haonan Lin, QianYing Wang, Yaqiang Wu, mingxiang cai, Luyan Wang, Yan Chen, Haiping Zhu, Ping Chen
  • VAEGPT-Sim: Improving Sentence Representation with Limited Corpus Using Gradually-Denoising VAE
    Zhenyi Wang, Haiyan Ning, Qing Ling, Dan Wang
  • PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion
    Yiduo Guo, Zekai Zhang, Yaobo Liang, Dongyan Zhao, Nan Duan
  • Fact-and-Reflection (FaR) Improves Confidence Calibration of Large Language Models
    Xinran Zhao, Hongming Zhang, Xiaoman Pan, Wenlin Yao, Dong Yu, Tongshuang Wu, Jianshu Chen
  • DB-LLM: Accurate Dual-Binarization for Efficient LLMs
    Hong Chen, Chengtao Lv, Liang Ding, Haotong Qin, Xiabin Zhou, Yifu Ding, Xuebo Liu, Min Zhang, Jinyang Guo, Xianglong Liu, Dacheng Tao
  • TempCompass: Do Video LLMs Really Understand Videos?
    Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, Lei Li, Sishuo Chen, Xu Sun, Lu Hou
  • Teaching Large Language Models an Unseen Language on the Fly
    Chen Zhang, Xiao Liu, Jiuheng Lin, Yansong Feng
  • Error Analysis Prompting Enables Human-Like Translation Evaluation in Large Language Models
    Qingyu Lu, Baopu Qiu, Liang Ding, Kanjian Zhang, Tom Kocmi, Dacheng Tao
  • DiffChat: Learning to Chat with Text-to-Image Synthesis Models for Interactive Image Creation
    Jiapeng Wang, Chengyu Wang, Tingfeng Cao, Jun Huang, Lianwen Jin
  • Rationales for Answers to Simple Math Word Problems Confuse Large Language Models
    Yidan Zhang, Mingfeng Xue, Dayiheng Liu, Zhenan He
  • ResLoRA: Identity Residual Mapping in Low-Rank Adaption
    Shuhua Shi, Shaohan Huang, Minghui Song, Zhoujun Li, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang
  • Towards Objectively Benchmarking Social Intelligence of Language Agents at the Action Level
    Chenxu Wang, Bin Dai, Huaping Liu, Baoyuan Wang
  • Semantic Role Labeling from Chinese Speech via End-to-End Learning
    Huiyao Chen, Xinxin Li, Meishan Zhang, Min Zhang
  • MEEL: Multi-Modal Event Evolution Learning
    Zhengwei Tao, Zhi Jin, Junqiang Huang, Xiancai Chen, Xiaoying Bai, Yifan Zhang, Chongyang Tao
  • LLM-REDIAL: A Large-Scale Dataset for Conversational Recommender Systems Created from User Behaviors with LLMs
    Tingting Liang, Chenxin Jin, Lingzhi Wang, Wenqi Fan, Congying Xia, Kai Chen, Yuyu Yin
  • Investigating Subtler Biases in LLMs: Ageism, Beauty, Institutional, and Nationality Bias in Generative Models
    Mahammed Kamruzzaman, Md. Minul Islam Shovon, Gene Louis Kim
  • EVIT: Event-Oriented Instruction Tuning for Event Reasoning
    Zhengwei Tao, Xiancai Chen, Zhi Jin, Xiaoying Bai, Haiyan Zhao, Yiwei Lou
  • InstructCMP: Length Control in Sentence Compression through Instruction-based Large Language Models
    Juseon-Do, Hidetaka Kamigaito, Manabu Okumura, Jingun Kwon
  • SymTax: Symbiotic Relationship and Taxonomy Fusion for Effective Citation Recommendation
    Karan Goyal, Mayank Goel, Vikram Goyal, Mukesh Mohania
  • Assessing News Thumbnail Representativeness: Counterfactual text can enhance the cross-modal matching ability
    Yejun Yoon, Seunghyun Yoon, Kunwoo Park
  • Towards Better Question Generation in QA-based Event Extraction
    Zijin Hong, Jian Liu
  • Budget-Constrained Tool Learning with Planning
    Yuanhang Zheng, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu
  • TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wild
    Huayang Li, Siheng Li, Deng Cai, Longyue Wang, Lemao Liu, Taro Watanabe, Yujiu Yang, Shuming Shi
  • The Critique of Critique
    Shichao Sun, Junlong Li, Weizhe Yuan, Ruifeng Yuan, Wenjie Li, Pengfei Liu
  • CoCo-Agent: A Comprehensive Cognitive MLLM Agent for Smartphone GUI Automation
    Xinbei Ma, Zhuosheng Zhang, hai zhao
  • FRVA: Fact-Retrieval and Verification Augmented Entailment Tree Generation for Explainable Question Answering
    Yue Fan, Hu zhang, Ru Li, YuJie Wang, Hongye Tan, Jiye Liang
  • P4: Plug-and-Play Discrete Prompting for Large Language Models Personalization
    Yuansen Zhang, Xiao Wang, Tianze Chen, Jiayi Fu, Tao Gui, Qi Zhang
  • RRNorm: A Novel Framework for Chinese Disease Diagnoses Normalization via LLM-Driven Terminology Component Recognition and Reconstruction
    Yongqi Fan, yansha zhu, KUI XUE, Jingping Liu, Tong Ruan
  • Unexpected Phenomenon: LLMs’ Spurious Associations in Information Extraction
    Weiyan Zhang, Wanpeng Lu, Jiacheng Wang, Yating Wang, Lihan Chen, Haiyun Jiang, Jingping Liu, Tong Ruan
  • AutoCAP: Towards Automatic Cross-lingual Alignment Planning for Zero-shot Chain-of-Thought
    Yongheng Zhang, Qiguang Chen, Min Li, Wanxiang Che, Libo Qin
  • LCS: A Language Converter Strategy for Zero-Shot Neural Machine Translation
    Zengkui Sun, Yijin Liu, Fandong Meng, Jinan Xu, Yufeng Chen, Jie Zhou
  • Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data
    Xiao Liu, Zirui Wu, Xueqing Wu, Pan Lu, Kai-Wei Chang, Yansong Feng
  • On the Vulnerability of Safety Alignment in Open-Access LLMs
    Jingwei Yi, Rui Ye, Qisi Chen, Bin Benjamin Zhu, Siheng Chen, Defu Lian, Guangzhong Sun, Xing Xie, Fangzhao Wu
  • PEK: A Parameter-Efficient Framework for Knowledge-Grounded Dialogue Generation
    Pan Yang, Dandan Song, Zhijing Wu, Yanru Zhou
  • Outdated Issue Aware Decoding for Factual Knowledge Editing
    Zengkui Sun, Yijin Liu, Jiaan Wang, Fandong Meng, Jinan Xu, Yufeng Chen, Jie Zhou
  • Disentangling Dialect from Social Bias via Multitask Learning to Improve Fairness
    Maximilian Spliethöver, Sai Nikhil Menon, Henning Wachsmuth
  • DP-MLM: Differentially Private Text Rewriting Using Masked Language Models
    Stephen Meisenbacher, Maulik Chevli, Juraj Vladika, Florian Matthes
  • Question-Instructed Visual Descriptions for Zero-Shot Video Answering
    David Orlando Romero Mogrovejo, Thamar Solorio
  • EX-FEVER: A Dataset for Multi-hop Explainable Fact Verification
    Huanhuan Ma, Weizhi Xu, Yifan Wei, Liuji Chen, Liang Wang, Qiang Liu, Shu Wu, Liang Wang
  • Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models
    Zehui Chen, Kuikun Liu, Qiuchen Wang, Wenwei Zhang, Jiangning Liu, Dahua Lin, Kai Chen, Feng Zhao
  • Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification
    Ekaterina Fadeeva, Aleksandr Rubashevskii, Artem Shelmanov, Sergey Petrakov, Haonan Li, Hamdy Mubarak, Evgenii Tsymbalov, Gleb Kuzmin, Alexander Panchenko, Timothy Baldwin, Preslav Nakov, Maxim Panov
  • Deciphering the Impact of Pretraining Data on Large Language Models through Machine Unlearning
    yang zhao, Li Du, Xiao Ding, Kai Xiong, Zhouhao Sun, Shi jun, Ting Liu, Bing Qin
  • Critical Learning Periods: Leveraging Early Training Dynamics for Efficient Data Pruning
    Everlyn Asiko Chimoto, Jay Gala, Orevaoghene Ahia, Julia Kreutzer, Bruce Bassett, Sara Hooker
  • What Are You Token About? Differentiable Perturbed Top-$k$ Token Selection for Scientific Document Summarization
    Luca Ragazzi, Paolo Italiani, Gianluca Moro, Mattia Panni
  • Description Boosting for Zero-Shot Entity and Relation Classification
    Gabriele Picco, Leopold Fuchs, Marcos Martínez Galindo, Alberto Purpura, Vanessa López, Hoang Thanh Lam
  • Domain-Aware $k$-Nearest-Neighbor Knowledge Distillation for Machine Translation
    Zhexuan Wang, Shudong Liu, Xuebo Liu, Miao Zhang, Derek F. Wong, Min Zhang
  • Beyond Single-Event Extraction: Towards Efficient Document-Level Multi-Event Argument Extraction
    Wanlong Liu, Li Zhou, DingYi Zeng, Yichen Xiao, Shaohuan Cheng, Chen Zhang, Grandee Lee, Malu Zhang, Wenyu Chen
  • Revisiting Interpolation Augmentation for Speech-to-Text Generation
    Chen Xu, Jie Wang, Xiaoqian Liu, Qian qian Dong, Chunliang Zhang, Tong Xiao, JingBo Zhu, Dapeng Man, Wu Yang
  • Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk
    Dennis Thomas Ulmer, Elman Mansimov, Kaixiang Lin, Lijia Sun, Xibin Gao, Yi Zhang
  • Semantic are Beacons: A Semantic Perspective for Unveiling Parameter-Efficient Fine-Tuning in Knowledge Learning
    Renzhi Wang, Piji Li
  • Leveraging Collection-Wide Similarities for Unsupervised Document Structure Extraction
    Gili Lior, Yoav Goldberg, Gabriel Stanovsky
  • Enhancing Cross Text-Molecule Learning by Self-Augmentation
    Jiang Yinuo, Xiang Zhuang, Keyan Ding, Qiang Zhang, Huajun Chen
  • RePALM: Popular Quote Tweet Generation via Auto-Response Augmentation
    Erxin Yu, Jing Li, Chunpu Xu
  • On the Effect of (Near) Duplicate Subwords in Language Modelling
    Anton Schäfer, Thomas Hofmann, Imanol Schlag, Tiago Pimentel
  • Do Pre-Trained Language Models Detect and Understand Semantic Underspecification? Ask the DUST!
    Frank Wildenburg, Michael Hanna, Sandro Pezzelle
  • Visual Hallucinations of Multi-modal Large Language Models
    Wen Huang, Hongbin Liu, Minxin Guo, Neil Zhenqiang Gong
  • SumSurvey: An Abstractive Dataset of Scientific Survey Papers for Long Document Summarization
    Ran Liu, Ming Liu, Min Yu, He Zhang, Jianguo Jiang, Gang Li, Weiqing Huang
  • Understanding and Patching Compositional Reasoning in LLMs
    Zhaoyi Li, Gangwei Jiang, Hong Xie, Linqi Song, Defu Lian, Ying Wei
  • Bilingual Rhetorical Structure Parsing with Large Parallel Annotations
    Elena Chistova
  • Book2Dial: Generating Teacher Student Interactions from Textbooks for Cost-Effective Development of Educational Chatbots
    Junling Wang, Jakub Macina, Nico Daheim, Sankalan Pal Chowdhury, Mrinmaya Sachan
  • SELP: A Semantically-Driven Approach for Separated and Accurate Class Prototypes in Few-Shot Text Classification
    Wenxin Liang, Tingyu Zhang, Han Liu, Feng Zhang
  • Automated Focused Feedback Generation for Scientific Writing Assistance
    Eric Chamoun, Michael Sejr Schlichtkrull, Andreas Vlachos
  • FastGAS: Fast Graph-based Annotation Selection for In-Context Learning
    Zihan Chen, Song Wang, Cong Shen, Jundong Li
  • Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activations
    Bowen Shen, Zheng Lin, Daren Zha, Wei Liu, Jian Luan, Bin Wang, Weiping Wang
  • Deductive Closure Training of Language Models for Coherence, Accuracy, and Updatability
    Afra Feyza Akyürek, Ekin Akyürek, Leshem Choshen, Derry Wijaya, Jacob Andreas
  • Self-Supervised Singing Voice Pre-Training towards Speech-to-Singing Conversion
    Ruiqi Li, Rongjie Huang, Yongqi Wang, Zhiqing Hong, Zhou Zhao
  • Evaluating Large Language Model Biases in Persona-Steered Generation
    Andy Liu, Mona T. Diab, Daniel Fried
  • Leveraging Entity Information for Cross-Modality Correlation Learning: The Entity-Guided Multimodal Summarization
    Yanghai Zhang, Ye Liu, Shiwei Wu, Kai Zhang, Xukai Liu, Qi Liu, Enhong Chen
  • CR-UTP: Certified Robustness against Universal Text Perturbations on Large Language Models
    Qian Lou, Xin Liang, Jiaqi Xue, Yancheng Zhang, Rui Xie, Mengxin Zheng
  • Recovering document annotations for sentence-level bitext
    Rachel Wicks, Matt Post, Philipp Koehn
  • MetaPro 2.0: Computational Metaphor Processing on the Effectiveness of Anomalous Language Modeling
    Rui Mao, Kai He, Claudia Beth Ong, Qian Liu, Erik Cambria
  • Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling
    Shenzhi Wang, Chang Liu, Zilong Zheng, Siyuan Qi, Shuo Chen, Qisen Yang, Andrew Zhao, Chaofei Wang, Shiji Song, Gao Huang
  • Direct Preference Optimization with an Offset
    Afra Amini, Tim Vieira, Ryan Cotterell
  • TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head Translation
    Xize Cheng, Rongjie Huang, Linjun Li, Zehan Wang, Tao Jin, Aoxiong Yin, Chen Feiyang, Xinyu Duan, Baoxing Huai, Zhou Zhao
  • More than Minorities and Majorities: Understanding Multilateral Bias in Language Generation
    Jiaxu Zhao, Zijing Shi, Yitong Li, Yulong Pei, Ling Chen, Meng Fang, Mykola Pechenizkiy
  • Fair Federated Learning with Biased Vision-Language Models
    Huimin Zeng, Zhenrui Yue, Yang Zhang, Lanyu Shang, Dong Wang
  • SpeechGuard: Exploring the Adversarial Robustness of Multi-modal Large Language Models
    Raghuveer Peri, Sai Muralidhar Jayanthi, Srikanth Ronanki, Anshu Bhatia, Karel Mundnich, Saket Dingliwal, Nilaksh Das, Zejiang Hou, Goeric Huybrechts, Srikanth Vishnubhotla, Daniel Garcia-Romero, Sundararajan Srinivasan, Kyu J. Han, Katrin Kirchhoff
  • ACUEval: Fine-grained Hallucination Evaluation and Correction for Abstractive Summarization
    David Wan, Koustuv Sinha, Srini Iyer, Asli Celikyilmaz, Mohit Bansal, Ramakanth Pasunuru
  • An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models
    Xiongtao Zhou, Jie He, Yuhua Ke, Guangyao Zhu, Victor Gutierrez Basulto, Jeff Z. Pan
  • PARADISE: Evaluating Implicit Planning Skills of Language Models with Procedural Warnings and Tips Dataset
    Arda Uzunoglu, Gözde Gül Şahin, Abdulfattah Safa
  • TURNA: A Turkish Encoder-Decoder Language Model for Enhanced Understanding and Generation
    Gökçe Uludoğan, Zeynep Yirmibeşoğlu Balal, Furkan Akkurt, Meliksah Turker, Onur Gungor, Susan Üsküdarlı
  • From Discrimination to Generation: Low-Resource Intent Detection with Language Model Instruction Tuning
    Feng Zhang, Wei Chen, Fei Ding, Meng Gao, Tengjiao Wang, Jiahui Yao, Jiabin Zheng
  • Efficient Continual Pre-training for Building Domain Specific Large Language Models
    Yong Xie, Karan Aggarwal, Aitzaz Ahmad
  • Distantly-Supervised Joint Extraction with Noise-Robust Learning
    Yufei Li, Xiao Yu, Yanghong Guo, Yanchi Liu, Haifeng Chen, Cong Liu
  • LLM Factoscope: Uncovering LLMs’ Factual Discernment through Measuring Inner States
    Jinwen He, Yujia Gong, Zijin Lin, Cheng’an Wei, Yue Zhao, Kai Chen
  • DictLLM: Harnessing Key-Value Data Structures with Large Language Models for Enhanced Medical Diagnostics
    YiQiu Guo, Yuchen Yang, Ya Zhang, Yu Wang, Yanfeng Wang
  • imapScore: Medical Fact Evaluation Made Easy
    Huimin WANG, Yutian Zhao, Xian Wu, Yefeng Zheng
  • Making Harmful Behaviors Unlearnable for Large Language Models
    Xin Zhou, Yi Lu, Ruotian Ma, Yujian Wei, Tao Gui, Qi Zhang, Xuanjing Huang
  • Debiasing Large Language Models with Structured Knowledge
    Congda MA, Tianyu Zhao, Manabu Okumura
  • Contrastive Instruction Tuning
    Tianyi Yan, Fei Wang, James Y. Huang, Wenxuan Zhou, Fan Yin, Aram Galstyan, Wenpeng Yin, Muhao Chen
  • Bootstrapped Pre-training with Dynamic Identifier Prediction for Generative Retrieval
    Yubao Tang, Ruqing Zhang, Jiafeng Guo, Maarten de Rijke, Yixing Fan, Xueqi Cheng
  • Refining and Synthesis: A Simple yet Effective Data Augmentation Framework for Cross-Domain Aspect-based Sentiment Analysis
    Haining Wang, Kang He, Bobo Li, Lei Chen, Fei Li, Xu Han, Chong Teng, Donghong Ji
  • Codec-SUPERB: An In-Depth Analysis of Sound Codec Models
    Haibin Wu, Ho-Lam Chung, Yi-Cheng Lin, Yuan-Kuei Wu, Xuanjun Chen, Yu-Chi Pai, Hsiu-Hsuan Wang, Kai-Wei Chang, Alexander H. Liu, Hung-yi Lee
  • CACL: Community-Aware Heterogeneous Graph Contrastive Learning for Social Media Bot Detection
    Sirry Chen, Shuo Feng, Liang Songsong, Chen-Chen Zong, Jing Li, Piji Li
  • Are Machines Better at Complex Reasoning? Unveiling Human-Machine Inference Gaps in Entailment Verification
    Soumya Sanyal, Tianyi Xiao, Jiacheng Liu, Wenya Wang, Xiang Ren
  • ChartInstruct: Instruction Tuning for Chart Comprehension and Reasoning
    Ahmed Masry, Mehrad Shahmohammadi, Md Rizwan Parvez, Enamul Hoque, Shafiq Joty
  • Improving Multilingual Neural Machine Translation by Utilizing Semantic and Linguistic Features
    Mengyu Bu, Shuhao Gu, Yang Feng
  • Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Architecture-Routed Mixture-of-Experts
    Ganesh Jawahar, Haichuan Yang, Yunyang Xiong, Zechun Liu, Dilin Wang, Fei Sun, Meng Li, Aasish Pappu, Barlas Oguz, Muhammad Abdul-Mageed, Laks V. S. Lakshmanan, Raghuraman Krishnamoorthi, Vikas Chandra
  • SharedCon: Implicit Hate Speech Detection using Shared Semantics
    Hyeseon Ahn, Youngwook Kim, Jungin Kim, Yo-Sub Han
  • Smaller Language Models are capable of selecting Instruction-Tuning Training Data for Larger Language Models
    Dheeraj Mekala, Alex Nguyen, Jingbo Shang
  • InjecAgent: Benchmarking Indirect Prompt Injections in Tool-Integrated Large Language Model Agents
    Qiusi Zhan, Zhixiang Liang, Zifan Ying, Daniel Kang
  • Generalization-Enhanced Code Vulnerability Detection via Multi-Task Instruction Fine-Tuning
    Xiaohu Du, Ming Wen, Jiahao Zhu, Zifan Xie, Bin Ji, Huijun Liu, Xuanhua Shi, Hai Jin
  • PPTSER: A Plug-and-Play Tag-guided Method for Few-shot Semantic Entity Recognition on Visually-rich Documents
    Wenhui Liao, Jiapeng Wang, Zening Lin, Longfei Xiong, Lianwen Jin
  • LLM Performance Predictors are good initializers for Architecture Search
    Ganesh Jawahar, Muhammad Abdul-Mageed, Laks V. S. Lakshmanan, Dujian Ding
  • MODDP: A Multi-modal Open-domain Chinese Dataset for Dialogue Discourse Parsing
    Chen Gong, DeXin Kong, Suxian Zhao, Xingyu Li, Guohong Fu
  • Chinese MentalBERT: Domain-Adaptive Pre-training on Social Media for Chinese Mental Health Text Analysis
    Wei Zhai, Hongzhi Qi, Qing Zhao, Jianqiang Li, Ziqi Wang, Han Wang, Bing Xiang Yang, Guanghui FU
  • Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization
    Zhanhui Zhou, Jie Liu, Jing Shao, Xiangyu Yue, Chao Yang, Wanli Ouyang, Yu Qiao
  • DORY: Deliberative Prompt Recovery for LLM
    Lirong Gao, Ru Peng, Yiming Zhang, Junbo Zhao
  • STYLE: Improving Domain Transferability of Asking Clarification Questions in Large Language Model Powered Conversational Agents
    Yue Chen, Chen Huang, Yang Deng, Wenqiang Lei, Dingnan Jin, Jia Liu, Tat-Seng Chua
  • Evaluating Robustness of Generative Search Engine on Adversarial Factoid Questions
    Xuming Hu, Xiaochuan Li, Junzhe Chen, Yinghui Li, Yangning Li, Xiaoguang Li, Yasheng Wang, Qun Liu, Lijie Wen, Philip S. Yu, Zhijiang Guo
  • Automatic Engineering of Long Prompts
    Cho-Jui Hsieh, Si Si, Felix Yu, Inderjit S Dhillon
  • AS-ES Learning: Towards efficient CoT learning in small models
    Nuwa Xi, Yuhan Chen, Sendong Zhao, Haochun Wang, GongZhang, Bing Qin, Ting Liu
  • II-MMR: Identifying and Improving Multi-modal Multi-hop Reasoning in Visual Question Answering
    Jihyung Kil, Farideh Tavazoee, Dongyeop Kang, Joo-Kyung Kim
  • TAME-RD: Text Assisted Replication of Image Multi-Adjustments for Reverse Designing
    Pooja Guhan, Uttaran Bhattacharya, Somdeb Sarkhel, Vahid Azizi, Xiang Chen, Saayan Mitra, Aniket Bera, Dinesh Manocha
  • Batch-ICL: Effective, Efficient, and Order-Agnostic In-Context Learning
    Kaiyi Zhang, Ang Lv, Yuhan Chen, Hansen Ha, Tao XU, Rui Yan
  • IndicVoices: Towards building an Inclusive Multilingual Speech Dataset for Indian Languages
    Tahir Javed, Janki Atul Nawale, Eldho Ittan George, Sakshi Joshi, Kaushal Santosh Bhogale, Deovrat Mehendale, Ishvinder Virender Sethi, Aparna Ananthanarayanan, Hafsah Faquih, Pratiti Palit, Sneha Ravishankar, Saranya Sukumaran, Tripura Panchagnula, Sunjay Murali, Kunal Sharad Gandhi, Ambujavalli R, Manickam K M, C Venkata Vaijayanthi, Krishnan Srinivasa Raghavan Karunganni, Pratyush Kumar, Mitesh M Khapra
  • ViCor: Bridging Visual Understanding and Commonsense Reasoning with Large Language Models
    Kaiwen Zhou, Kwonjoon Lee, Teruhisa Misu, Xin Eric Wang
  • Decomposition for Enhancing Attention: Improving LLM-based Text-to-SQL through Workflow Paradigm
    Yuanzhen Xie, Xinzhou Jin, Tao Xie, matrixmxlin, Liang Chen, Chenyun Yu, Cheng lei, Chengxiang Zhuo, Bo Hu, Zang Li
  • Unveiling Opinion Evolution via Prompting and Diffusion for Short Video Fake News Detection
    Linlin Zong, Jiahui Zhou, Wenmin Lin, Xinyue Liu, Xianchao Zhang, Bo Xu
  • iSign: A Benchmark for Indian Sign Language Processing
    Abhinav Joshi, Romit Mohanty, Mounika Kanakanti, Andesha Mangla, Sudeep Choudhary, Monali Barbate, Ashutosh Modi
  • Data Contamination Calibration for Black-box LLMs
    Wentao Ye, Jiaqi Hu, Liyao Li, Haobo Wang, Gang Chen, Junbo Zhao
  • Truth-Aware Context Selection: Mitigating Hallucinations of Large Language Models Being Misled by Untruthful Contexts
    Tian Yu, Shaolei Zhang, Yang Feng
  • Efficiently Exploring Large Language Models for Document-Level Machine Translation with In-context Learning
    Menglong Cui, Jiangcun Du, shaolin Zhu, Deyi Xiong
  • Improving Grammatical Error Correction via Contextual Data Augmentation
    Yixuan Wang, Baoxin Wang, Yijun Liu, Qingfu Zhu, Dayong Wu, Wanxiang Che
  • RECOST: External Knowledge Guided Data-efficient Instruction Tuning
    Qi Zhang, Yiming Zhang, Haobo Wang, Junbo Zhao
  • Understanding Cross-Lingual Alignment—A Survey
    Katharina Hämmerl, Jindřich Libovický, Alexander Fraser
  • Mitigate Negative Transfer with Similarity Heuristic Lifelong Prompt Tuning
    Chenyuan Wu, Gangwei Jiang, Defu Lian
  • PANDA: Preference Adaptation for Enhancing Domain-Specific Abilities of LLMs
    An Liu, Zonghan Yang, Zhenhe Zhang, Qingyuan Hu, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu
  • Developing PUGG for Polish: A Modern Approach to KBQA, MRC, and IR Dataset Construction
    Albert Sawczyn, Katsiaryna Viarenich, Konrad Wojtasik, Aleksandra Domogała, Marcin Oleksy, Maciej Piasecki, Tomasz Jan Kajdanowicz
  • Knowledge-to-SQL: Enhancing SQL Generation with Data Expert LLM
    Zijin Hong, Zheng Yuan, Hao Chen, Qinggang Zhang, Feiran Huang, Xiao Huang
  • Enhancing Distractor Generation for Multiple-Choice Questions with Retrieval Augmented Pretraining and Knowledge Graph Integration
    Han Cheng Yu, Yu An Shih, Kin Man Law, KaiYu Hsieh, Yu Chen Cheng, Hsin Chih Ho, Zih An Lin, WEN-CHUAN HSU, Yao-Chung Fan
  • Exploiting Positional Bias for Query-Agnostic Generative Content in Search
    Andrew Parry, Sean MacAvaney, Debasis Ganguly
  • ICC : Quantifying Image Caption Concreteness for Multimodal Dataset Curation
    Moran Yanuka, Morris Alper, Hadar Averbuch-Elor, Raja Giryes
  • On LLMs-Driven Synthetic Data Generation, Curation, and Evaluation: A Survey
    Lin Long, Rui Wang, Ruixuan Xiao, Junbo Zhao, Xiao Ding, Gang Chen, Haobo Wang
  • Accelerating Multilingual Language Model for Excessively Tokenized Languages
    Jimin Hong, Gibbeum Lee, Jaewoong Cho
  • Distillation Enhanced Generative Retrieval
    Yongqi Li, Zhen Zhang, Wenjie Wang, Liqiang Nie, Wenjie Li, Tat-Seng Chua
  • ToxVidLM: A Multimodal Framework for Toxicity Detection in Code-Mixed Videos
    Krishanu Maity, A.S. Poornash, Sriparna Saha, Pushpak Bhattacharyya
  • StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models
    Zhicheng Guo, Sijie Cheng, Hao Wang, Shihao Liang, Yujia Qin, Peng Li, Zhiyuan Liu, Maosong Sun, Yang Liu
  • Both Matter: Enhancing the Emotional Intelligence of Large Language Models without Compromising the General Intelligence
    Weixiang Zhao, Zhuojun Li, Shilong Wang, Yang Wang, Yulin Hu, Yanyan Zhao, Chen Wei, Bing Qin
  • KorNAT: LLM Alignment Benchmark for Korean Social Values and Common Knowledge
    Jiyoung Lee, Minwoo Kim, Seungho Kim, Junghwan Kim, Seunghyun Won, Hwaran Lee, Edward Choi
  • Enhancing Adverse Drug Event Detection with Multimodal Dataset: Corpus Creation and Model Development
    Pranab Sahoo, Ayush Kumar Singh, Sriparna Saha, Aman Chadha, Samrat Mondal
  • Space Decomposition for Sentence Embedding
    Wuttikorn Ponwitayarat, Peerat Limkonchotiwat, Ekapol Chuangsuwanich, Sarana Nutanong
  • Improving Low-Resource Machine Translation for Formosan Languages Using Bilingual Lexical Resources
    Francis Zheng, Edison Marrese-Taylor, Yutaka Matsuo
  • CMMLU: Measuring massive multitask language understanding in Chinese
    Haonan Li, Yixuan Zhang, Fajri Koto, Yifei Yang, hai zhao, Yeyun Gong, Nan Duan, Timothy Baldwin
  • Prometheus-Vision: Vision-Language Model as a Judge for Fine-Grained Evaluation
    Seongyun Lee, Seungone Kim, Sue Hyun Park, Geewook Kim, Minjoon Seo
  • Evaluating Mathematical Reasoning of Large Language Models: A Focus on Error Identification and Correction
    Xiaoyuan Li, Wenjie Wang, Moxin Li, Junrong Guo, Yang Zhang, Fuli Feng
  • Less is KEN: a Universal and Simple Non-Parametric Pruning Algorithm for Large Language Models
    Michele Mastromattei, Fabio Massimo Zanzotto
  • When Do LLMs Need Retrieval Augmentation? Mitigating LLMs’ Overconfidence Helps Retrieval Augmentation
    Shiyu Ni, Keping Bi, Jiafeng Guo, Xueqi Cheng
  • Hybrid Alignment Training for Large Language Models
    Chenglong Wang, Hang Zhou, Kaiyan Chang, Bei Li, Yongyu Mu, Tong Xiao, Tongran Liu, JingBo Zhu
  • Graph-Structured Speculative Decoding
    Zhuocheng Gong, Jiahao Liu, Ziyue Wang, Pengfei Wu, Jingang Wang, Xunliang Cai, Dongyan Zhao, Rui Yan
  • Duwak: Dual Watermarks in Large Language Models
    Chaoyi Zhu, Jeroen M. Galjaard, Pin-Yu Chen, Lydia Y. Chen
  • CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion
    Qibing Ren, Chang Gao, Jing Shao, Junchi Yan, Xin Tan, Wai Lam, Lizhuang Ma
  • Mitigating Reversal Curse in Large Language Models via Semantic-aware Permutation Training
    Qingyan Guo, Rui Wang, Junliang Guo, Xu Tan, Jiang Bian, Yujiu Yang
  • wav2vec-S: Adapting Pre-trained Speech Models for Streaming
    Biao Fu, Kai Fan, Minpeng Liao, Yidong Chen, Xiaodong Shi, Zhongqiang Huang
  • Peering into the Mind of Language Models: An Approach for Attribution in Contextual Question Answering
    Anirudh Phukan, Shwetha S, Apoorv Saxena, Koustava Goswami, Balaji Vasan Srinivasan
  • TRAP: Targeted Random Adversarial Prompt Honeypot for Black-Box Identification
    Martin Gubri, Dennis Thomas Ulmer, Hwaran Lee, Sangdoo Yun, Seong Joon Oh
  • CLASP: Cross-modal Alignment Using Pre-trained Unimodal Models
    Jianing Zhou, Ziheng Zeng, Hongyu Gong, Suma Bhat
  • TimeToM: Temporal Space is the Key to Unlocking the Door of Large Language Models’ Theory-of-Mind
    Guiyang Hou, Wenqi Zhang, Yongliang Shen, Linjuan Wu, Weiming Lu
  • Identifying and Mitigating Annotation Bias in Natural Language Understanding using Causal Mediation Analysis
    Sitiporn Sae Lim, Can Udomcharoenchaikit, Peerat Limkonchotiwat, Ekapol Chuangsuwanich, Sarana Nutanong
  • Perturbed examples reveal invariances shared by language models
    Ruchit Rawal, Mariya Toneva
  • Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation
    Yiwei Li, Fei Mi, Yitong Li, Yasheng Wang, Bin Sun, Shaoxiong Feng, Kan Li
  • Discourse Structure-Aware Prefix for Generation-Based End-to-End Argumentation Mining
    Yang Sun, Guanrong Chen, Caihuayang, Jianzhu Bao, Bin Liang, Xi Zeng, Min Yang, Ruifeng Xu
  • Poor-Supervised Evaluation for SuperLLM via Mutual Consistency
    Peiwen Yuan, Shaoxiong Feng, Yiwei Li, Xinglin Wang, Boyuan Pan, Heda Wang, Yao Hu, Kan Li
  • Addressing Entity Translation Problem via Translation Difficulty and Context Diversity
    Tian Liang, Xing Wang, Mingming Yang, Yujiu Yang, Shuming Shi, Zhaopeng Tu
  • ADAM: Dense Retrieval Distillation with Adaptive Dark Examples
    Chongyang Tao, Chang Liu, Tao Shen, Can Xu, Xiubo Geng, Binxing Jiao, Daxin Jiang
  • Instruction Position Matters in Sequence Generation with Large Language Models
    Yijin Liu, Xianfeng Zeng, Chenze Shao, Fandong Meng, Jie Zhou
  • XMoE: Sparse Models with Fine-grained and Adaptive Expert Selection
    Yuanhang Yang, Shiyi Qi, Wenchao Gu, Chaozheng Wang, Cuiyun Gao, Zenglin Xu
  • BranchNorm: Robustly Scaling Extremely Deep Transformers
    Yijin Liu, Xianfeng Zeng, Fandong Meng, Jie Zhou
  • MusTQ: A Temporal Knowledge Graph Question Answering Dataset for Multi-Step Temporal Reasoning
    Tingyi Zhang, Jiaan Wang, Zhixu Li, Jianfeng Qu, An Liu, Zhigang Chen, Hongping Zhi
  • Deal, or no deal (or who knows)? Forecasting Uncertainty in Conversations using Large Language Models
    Anthony Sicilia, Hyunwoo Kim, Khyathi Chandu, Malihe Alikhani, Jack Hessel
  • Knowledge Fusion By Evolving Weights of Language Models
    Guodong DU, Jing Li, Hanting Liu, Runhua Jiang, Shuyang Yu, Yifei Guo, Sim Kuan Goh, Ho-Kin Tang
  • ScaLearn: Simple and Highly Parameter-Efficient Task Transfer by Learning to Scale
    Markus Frohmann, Carolin Holtermann, Shahed Masoudian, Anne Lauscher, Navid Rekabsaz
  • Visualizing Dialogues: Enhancing Image Selection through Dialogue Understanding with Large Language Models
    Chang-Sheng Kao, Yun-Nung Chen
  • MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data Visualization
    Zhiyu Yang, Zihan Zhou, Shuo Wang, Xin Cong, Xu Han, Yukun Yan, Zhenghao Liu, Zhixing Tan, Pengyuan Liu, Dong Yu, Zhiyuan Liu, Xiaodong Shi, Maosong Sun
  • Continual Few-shot Relation Extraction via Adaptive Gradient Correction and Knowledge Decomposition
    hu jianpeng, Chengxiang Tan, JiaCheng Xu, XiangyunKong
  • CMoralEval: A Moral Evaluation Benchmark for Chinese Large Language Models
    Linhao Yu, Yongqi Leng, Yufei Huang, Shang Wu, Haixin Liu, Xinmeng Ji, Jiahui Zhao, Jinwang Song, Tingting Cui, Xiaoqing Cheng, Liutao, Deyi Xiong
  • Cache & Distil: Optimising API Calls to Large Language Models
    Guillem Ramírez, Matthias Lindemann, Alexandra Birch, Ivan Titov
  • Investigating the Impact of Model Instability on Explanations and Uncertainty
    Sara Vera Marjanovic, Isabelle Augenstein, Christina Lioma
  • A Two-Stage Adaptation of Large Language Models for Text Ranking
    Longhui Zhang, Yanzhao Zhang, Dingkun Long, Pengjun Xie, Meishan Zhang, Min Zhang
  • Fine-tuning with HED-IT: The impact of human post-editing for dialogical language models
    Daniela Occhipinti, Michele Marchi, Irene Mondella, Huiyuan Lai, Felice Dell’Orletta, Malvina Nissim, Marco Guerini
  • Analyze, Generate and Refine: Query Expansion with LLMs for Zero-Shot Open-Domain QA
    Xinran Chen, Xuanang Chen, Ben He, Tengfei Wen, Le Sun
  • On the Evaluation of Speech Foundation Models for Spoken Language Understanding
    Siddhant Arora, Ankita Pasad, Chung-Ming Chien, Jionghao Han, Roshan Sharma, Jee-weon Jung, Hira Dhamyal, William Chen, Suwon Shon, Hung-yi Lee, Karen Livescu, Shinji Watanabe
  • Towards Multiple References Era – Addressing Data Leakage and Limited Reference Diversity in Machine Translation Evaluation
    Xianfeng Zeng, Yijin Liu, Fandong Meng, Jie Zhou
  • Prompting open-source and commercial language models for grammatical error correction of English learner text
    Christopher Davis, Andrew Caines, O Andersen, Shiva Taslimipoor, Helen Yannakoudakis, Zheng Yuan, Christopher Bryant, Marek Rei, Paula Buttery
  • BATS: BenchmArking Text Simplicity 🦇
    Christin Katharina Kreutz, Fabian Haak, Björn Engelmann, Philipp Schaer
  • Discovering influential text using convolutional neural networks
    Megan Ayers, Luke Sanford, Margaret Roberts, Eddie Yang
  • Generalization or Memorization: Data Contamination and Trustworthy Evaluation for Large Language Models
    Yihong Dong, Xue Jiang, Huanyu Liu, Zhi Jin, Bin Gu, Mengfei Yang, Ge Li
  • Efficient Training of Language Models with Compact and Consistent Next Token Distributions
    Ashutosh Sathe, Sunita Sarawagi
  • Ancient Chinese Glyph Identification Powered by Radical Semantics
    Yang Chi, Fausto Giunchiglia, Chuntao Li, Hao Xu
  • PUB: A Pragmatics Understanding Benchmark for Assessing LLMs’ Pragmatics Capabilities
    Settaluri Lakshmi Sravanthi, Meet Doshi, Pavan Kalyan Tankala, Rudra Murthy, Raj Dabre, Pushpak Bhattacharyya
  • EmoTransKG: An Innovative Emotion Knowledge Graph to Reveal Emotion Transformation
    Huan Zhao, Xupeng Zha, Zixing Zhang
  • How Vocabulary Sharing Facilitates Multilingualism in LLaMA?
    Fei Yuan, Shuai Yuan, Zhiyong Wu, Lei Li
  • Prefix Text as a Yarn: Eliciting Non-English Alignment in Foundation Language Model
    Runzhe Zhan, Xinyi Yang, Derek F. Wong, Lidia S. Chao, Yue Zhang
  • Dual Prompt Tuning based Contrastive Learning for Hierarchical Text Classification
    Sishi Xiong, Yu Zhao, Jie Zhang, Li Mengxiang, Zhongjiang He, Xuelong Li, Shuangyong Song
  • Probing the Emergence of Cross-lingual Alignment during LLM Training
    Hetong Wang, Pasquale Minervini, Edoardo Ponti
  • STSPL-SSC: Semi-Supervised Few-Shot Short Text Clustering with Semantic text similarity Optimized Pseudo-Labels
    Wenhua Nie, Lin Deng, Chang-Bo Liu, JialingWei, Ruitong Han, Haoran Zheng
  • A Comprehensive Evaluation of Quantization Strategies for Large Language Models
    Renren Jin, Jiangcun Du, Wuwei Huang, Wei Liu, Jian Luan, Bin Wang, Deyi Xiong
  • Exploiting Target Language Data for Neural Machine Translation Beyond Back Translation
    Abudurexiti Reheman, yingfeng luo, Junhao Ruan, Chunliang Zhang, Anxiang Ma, Tong Xiao, JingBo Zhu
  • Bayesian Prompt Ensembles: Model Uncertainty Estimation for Black-Box Large Language Models
    Francesco Tonolini, Nikolaos Aletras, Jordan Massiah, Gabriella Kazai
  • X-ACE: Explainable and Multi-factor Audio Captioning Evaluation
    Qian Wang, Jia-Chen Gu, Zhen-Hua Ling
  • Reasons to Reject? Aligning Language Models with Judgments
    Weiwen Xu, Deng Cai, Zhisong Zhang, Wai Lam, Shuming Shi
  • Decomposing Argumentative Essay Generation via Dialectical Planning of Complex Reasoning
    Yuhang He, Jianzhu Bao, Yang Sun, Bin Liang, Min Yang, Bing Qin, Ruifeng Xu
  • Large Language Models are Few-Shot Training Example Generators: A Case Study in Fallacy Recognition
    Tariq Alhindi, Smaranda Muresan, Preslav Nakov
  • Concept-aware Data Construction Improves In-context Learning of Language Models
    Michal Štefánik, Marek Kadlčík, Petr Sojka
  • Non-Autoregressive Machine Translation as Constrained HMM
    Haoran Li, Zhanming Jie, Wei Lu
  • Multi-modal Stance Detection: New Datasets and Model
    Bin Liang, Ang Li, Jingqian Zhao, Lin Gui, Min Yang, Yue Yu, Kam-Fai Wong, Ruifeng Xu
  • Enhanced Language Model Truthfulness with Learnable Intervention and Uncertainty Expression
    Farima Fatahi Bayat, Xin Liu, H. V. Jagadish, Lu Wang
  • MM-LLMs: Recent Advances in MultiModal Large Language Models
    Duzhen Zhang, Yahan Yu, Jiahua Dong, Chenxing Li, Dan Su, Chenhui Chu, Dong Yu
  • CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models
    Yizhi LI, Ge Zhang, Xingwei Qu, Jiali Li, ZHAOQUN LI, Noah Wang, Hao Li, Ruibin Yuan, Yinghao Ma, Kai Zhang, Wangchunshu Zhou, Yiming Liang, Lei Zhang, Lei Ma, Jiajun Zhang, Zuowen Li, Wenhao Huang, Chenghua Lin, Jie Fu
  • Countering Reward Over-Optimization in LLM with Demonstration-Guided Reinforcement Learning
    Mathieu Rita, Florian Strub, Rahma Chaabouni, Paul Michel, Emmanuel Dupoux, Olivier Pietquin
  • Enhancing Idiomatic Representation in Multiple Languages via an Adaptive Contrastive Triplet Loss
    Wei He, Marco Idiart, Carolina Scarton, Aline Villavicencio
  • AdaLomo: Low-memory Optimization with Adaptive Learning Rate
    Kai Lv, Hang Yan, Qipeng Guo, haijun Lv, Xipeng Qiu
  • Propagation and Pitfalls: Reasoning-based Assessment of Knowledge Editing through Counterfactual Tasks
    Wenyue Hua, Jiang Guo, Mingwen Dong, Henghui Zhu, Patrick Ng, Zhiguo Wang
  • Exciting Mood Changes: A Time-aware Hierarchical Transformer for Change Detection Modelling
    Anthony Hills, Talia Tseriotou, Xenia Miscouridou, Adam Tsakalidis, Maria Liakata
  • CorNav: Autonomous Agent with Self-Corrected Planning for Zero-Shot Vision-and-Language Navigation
    Xiwen Liang, Liang Ma, Shanshan Guo, Jianhua Han, Hang Xu, Shikui Ma, Xiaodan Liang
  • SciMMIR: Benchmarking Scientific Multi-modal Information Retrieval
    Siwei Wu, Yizhi LI, Kang Zhu, Ge Zhang, Yiming Liang, Kaijing Ma, Chenghao Xiao, Haoran Zhang, Bohao Yang, Wenhu Chen, Wenhao Huang, Noura Al Moubayed, Jie Fu, Chenghua Lin
  • Learning to Generate Instruction Tuning Datasets for Zero-Shot Task Adaptation
    Nihal V. Nayak, Yiyang Nan, Avi Trost, Stephen Bach
  • Demonstrations Are All You Need: Advancing Offensive Content Paraphrasing using In-Context Learning
    Anirudh Som, Karan Sikka, Helen Gent, Ajay Divakaran, Andreas Kathol, Dimitra Vergyri
  • Paying Attention to Deflections: Mining Pragmatic Nuances for Whataboutism Detection in Online Discourse
    Khiem Dinh Phi, Noushin Salek Faramarzi, Chenlu Wang, Ritwik Banerjee
  • Epistemology of Language Models: Do Language Models Have Holistic Knowledge?
    Minsu Kim, James Thorne
  • Strong hallucinations from negation and how to fix them
    Swarnadeep Bhar, Nicholas Asher
  • LLMs as Narcissistic Evaluators: When Ego Inflates Evaluation Scores
    Yiqi Liu, Nafise Sadat Moosavi, Chenghua Lin
  • HelloFresh: LLM Evalutions on Streams of Real-World Human Editorial Actions across X Community Notes and Wikipedia edits
    Tim Franzmeyer, Aleksandar Shtedritski, Samuel Albanie, Philip Torr, Joao F. Henriques, Jakob Nicolaus Foerster
  • Chaos with Keywords: Exposing Large Language Models Sycophancy to Misleading Keywords and Evaluating Defense Strategies
    Aswin RRV, Nemika Tyagi, Md Nayem Uddin, Neeraj Varshney, Chitta Baral
  • Empowering Large Language Models for Textual Data Augmentation
    Yichuan Li, Kaize Ding, Jianling Wang, Kyumin Lee
  • Choose Your Transformer: Improved Transferability Estimation of Transformer Models on Classification Tasks
    Lukas Garbaciauskas, Max Ploner, Alan Akbik
  • CaLM: Contrasting Large and Small Language Models to Verify Grounded Generation
    I-Hung Hsu, Zifeng Wang, Long Le, Lesly Miculicich, Nanyun Peng, Chen-Yu Lee, Tomas Pfister
  • TextEE: Benchmark, Reevaluation, Reflections, and Future Challenges in Event Extraction
    Kuan-Hao Huang, I-Hung Hsu, Tanmay Parekh, Zhiyu Xie, Zixuan Zhang, Prem Natarajan, Kai-Wei Chang, Nanyun Peng, Heng Ji
  • OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement
    Tianyu Zheng, Ge Zhang, Tianhao Shen, Xueling Liu, Bill Yuchen Lin, Jie Fu, Wenhu Chen, Xiang Yue
  • Measuring and Addressing Indexical Bias in Information Retrieval
    Caleb Ziems, William Barr Held, Jane Dwivedi-Yu, Diyi Yang
  • CIDAR: Culturally Relevant Instruction Dataset For Arabic
    Zaid Alyafeai, Khalid Almubarak, Ahmed Ashraf, Deema Alnuhait, Saied Alshahrani, Gubran A.Q. Abdulrahman, Gamil Ahmed, Qais Gawah, Zead Saleh, Mustafa Ghaleb, Yousef Ali, Maged S. Al-shaibani
  • RadGraph-XL: A Large-Scale Expert-Annotated Dataset for Entity and Relation Extraction from Radiology Reports
    Jean-Benoit Delbrouck, Pierre Joseph Marcel Chambon, Zhihong Chen, Maya Varma, Andrew Johnston, Louis Blankemeier, Dave Van Veen, Tan Bui, Steven Truong, Curtis Langlotz
  • SMART: Submodular Data Mixture Strategy for Instruction Tuning
    H S V N S Kowndinya Renduchintala, Sumit Bhatia, Ganesh Ramakrishnan
  • Selective “Selective Prediction”: Reducing Unnecessary Abstention in Vision-Language Reasoning
    Tejas Srinivasan, Jack Hessel, Tanmay Gupta, Bill Yuchen Lin, Yejin Choi, Jesse Thomason, Khyathi Chandu
  • Differentially Private Knowledge Distillation via Synthetic Text Generation
    James Flemings, Murali Annavaram
  • KIWI: A Dataset of Knowledge-Intensive Writing Instructions for Answering Research Questions
    Fangyuan Xu, Kyle Lo, Luca Soldaini, Bailey Kuehl, Eunsol Choi, David Wadden
  • XL-HeadTags: Leveraging Multimodal Retrieval Augmentation for the Multilingual Generation of News Headlines and Tags
    Faisal Tareque Shohan, Mir Tafseer Nayeem, Samsul Islam, Abu Ubaida Akash, Shafiq Joty
  • InFoBench: Evaluating Instruction Following Ability in Large Language Models
    Yiwei Qin, Kaiqiang Song, Yebowen Hu, Wenlin Yao, Sangwoo Cho, Xiaoyang Wang, Xuansheng Wu, Fei Liu, Pengfei Liu, Dong Yu
  • EcoRank: Budget-Constrained Text Re-ranking Using Large Language Models
    Muhammad Shihab Rashid, Jannat Ara Meem, Yue Dong, Vagelis Hristidis
  • FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models
    Gagan Bhatia, El Moatez Billah Nagoudi, Hasan Cavusoglu, Muhammad Abdul-Mageed
  • Aligning Large Multimodal Models with Factually Augmented RLHF
    Zhiqing Sun, Sheng Shen, Shengcao Cao, Haotian Liu, Chunyuan Li, Yikang Shen, Chuang Gan, Liangyan Gui, Yu-Xiong Wang, Yiming Yang, Kurt Keutzer, Trevor Darrell
  • The Art of Defending: A Systematic Evaluation and Analysis of LLM Defense Strategies on Safety and Over-Defensiveness
    Neeraj Varshney, Pavel Dolin, Agastya Seth, Chitta Baral
  • PAT-Questions: A Self-Updating Benchmark for Present-Anchored Temporal Question-Answering
    Jannat Ara Meem, Muhammad Shihab Rashid, Yue Dong, Vagelis Hristidis
  • $360^\circ$REA: Towards A Reusable Experience Accumulation with $360^\circ$ Assessment for Multi-Agent System
    Shen Gao, Hao Li, Zhengliang Shi, Chengrui Huang, Quan Tu, Shuo Shang, Zhiliang Tian, Minlie Huang
  • Extracting Polymer Nanocomposite Samples from Full-Length Documents
    Ghazal Khalighinejad, Defne Circi, L. Brinson, Bhuwan Dhingra
  • Leveraging LLM Reasoning Enhances Personalized Recommender Systems
    Alicia Y. Tsai, Adam Kraft, Long Jin, Chenwei Cai, Anahita Hosseini, Taibai Xu, Zemin Zhang, Lichan Hong, Ed H. Chi, Xinyang Yi
  • Toucan: Many-to-Many Translation for 150 African Language Pairs
    AbdelRahim A. Elmadany, Ife Adebara, Muhammad Abdul-Mageed
  • Evaluating Structural Generalization in Neural Machine Translation
    Ryoma Kumon, Daiki Matsuoka, Hitomi Yanaka
  • Figuratively Speaking: Authorship Attribution via Multi-Task Figurative Language Modeling
    Gregorios A Katsios, Ning Sa, Tomek Strzalkowski
  • CHAMP: A Competition-level Dataset for Fine-Grained Analyses of LLMs’ Mathematical Reasoning Capabilities
    Yujun Audrey Mao, Yoon Kim, Yilun Zhou
  • Improving Machine Translation with Large Language Models: A Preliminary Study with Cooperative Decoding
    Jiali Zeng, Fandong Meng, Yongjing Yin, Jie Zhou
  • Integrating Pre-Trained Speech and Language Models for End-to-End Speech Recognition
    Yukiya Hono, Koh Mitsuda, Tianyu Zhao, Kentaro Mitsui, Toshiaki Wakatsuki, Kei Sawada
  • Proving membership in LLM pretraining data via data watermarks
    Johnny Wei, Ryan Yixiang Wang, Robin Jia
  • SecFormer: Fast and Accurate Privacy-Preserving Inference for Transformer Models via SMPC
    Jinglong Luo, Yehong Zhang, Zhuo Zhang, Jiaqi Zhang, Xin Mu, Hui Wang, Yue Yu, Zenglin Xu
  • Raccoon: Prompt Extraction Benchmark of LLM-Integrated Applications
    Junlin Wang, Tianyi Yang, Roy Xie, Bhuwan Dhingra
  • History-Aware Conversational Dense Retrieval
    Fengran Mo, Chen Qu, Kelong Mao, Tianyu Zhu, Zhan Su, Kaiyu Huang, Jian-Yun Nie
  • Light Up the Shadows: Enhance Long-Tailed Entity Grounding with Concept-Guided Vision-Language Models
    Yikai Zhang, Qianyu He, Xintao Wang, Siyu Yuan, Jiaqing Liang, Yanghua Xiao
  • ZeroStance: Leveraging ChatGPT for Open-Domain Stance Detection via Dataset Generation
    Chenye Zhao, Yingjie Li, Cornelia Caragea, Yue Zhang
  • Boosting Zero-Shot Crosslingual Performance using LLM-Based Augmentations with Effective Data Selection
    Barah Fazili, Ashish Sunil Agrawal, Preethi Jyothi
  • Reinforcement Tuning for Detecting Stances and Debunking Rumors Jointly with Large Language Models
    Ruichao Yang, Wei Gao, Jing Ma, Hongzhan Lin, Bo Wang
  • Exploring the Potential of Dense Information in Multimodal Alignment
    Zhiyuan Fan, Zhihong Chen, Benyou Wang
  • InstructEval: Instruction-Tuned Text Evaluator from Human Preference
    Wenhao Wu, Wei Li, Xinyan Xiao, Jiachen Liu, Sujian Li
  • A Curious Case of Searching for the Correlation between Training Data and Adversarial Robustness of Transformer Textual Models
    Dang Cao Cuong, Dung D. Le, Thai Le
  • InstructGraph: Boosting Large Language Models via Graph-centric Instruction Tuning and Preference Alignment
    Jianing Wang, Junda Wu, Yupeng Hou, Yao Liu, Ming Gao, Julian McAuley
  • RaDA: Retrieval-augmented Web Agent Planning with LLMs
    Minsoo Kim, Victor Bursztyn, Eunyee Koh, Shunan Guo, seung-won hwang
  • Competition-Level Problems are Effective LLM Evaluators
    Yiming Huang, Zhenghao Lin, Xiao Liu, Yeyun Gong, Shuai Lu, Fangyu Lei, Yaobo Liang, yelong shen, Chen Lin, Nan Duan, Weizhu Chen
  • Large Language Models for Automated Open-domain Scientific Hypotheses Discovery
    Zonglin Yang, Xinya Du, JUNXIAN LI, Jie Zheng, Soujanya Poria, Erik Cambria
  • GRADUAL: Granularity-aware Dual Prototype Learning for Better Few-Shot Relation Extraction
    Zhiming Li, Yuchen Lyu
  • Training a Better Chinese Spelling Correction Model via Prior-knowledge Guided Teacher
    Chi Wei, shaobin huang, Rongsheng Li, Naiyu Yan, Rui Wang
  • The Revolution of Multimodal Large Language Models: A Survey
    Davide Caffagni, Federico Cocchi, Luca Barsellotti, Nicholas Moratelli, Sara Sarto, Lorenzo Baraldi, Lorenzo Baraldi, Marcella Cornia, Rita Cucchiara
  • OOP: Object-Oriented Programming Evaluation Benchmark for Large Language Models
    Shuai Wang, Liang Ding, Li Shen, Yong Luo, Bo Du, Dacheng Tao
  • Code Needs Comments: Enhancing Code LLMs with Comment Augmentation
    Demin Song, Honglin Guo, Yunhua Zhou, Shuhao Xing, Yudong Wang, Zifan Song, Wenwei Zhang, Qipeng Guo, Hang Yan, Xipeng Qiu, Dahua Lin
  • Efficient Domain Adaptation for Non-Autoregressive Machine Translation
    WangJie You, Pei Guo, Juntao Li, Kehai Chen, Min Zhang
  • Exploring Reversal Mathematical Reasoning Ability for Large Language Models
    Pei Guo, WangJie You, Juntao Li, Yan Bowen, Min Zhang
  • A Unified Joint Approach with Topological Context Learning and Rule Augmentation for Knowledge Graph Completion
    Jingtao Guo, Chunxia Zhang, Lingxi Li, Xiaojun Xue, Zhendong Niu
  • FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation
    Tu Vu, Mohit Iyyer, Xuezhi Wang, Noah Constant, Jerry Wei, Jason Wei, Chris Tar, Yun-Hsuan Sung, Denny Zhou, Quoc V Le, Thang Luong
  • ROSE Doesn’t Do That: Boosting the Safety of Instruction-Tuned Large Language Models with Reverse Prompt Contrastive Decoding
    Qihuang Zhong, Liang Ding, Juhua Liu, Bo Du, Dacheng Tao
  • CR-LLM: A Dataset and Optimization for Concept Reasoning of Large Language Models
    Nianqi Li, Jingping Liu, Sihang Jiang, Haiyun Jiang, Yanghua Xiao, Jiaqing Liang, Zujie Liang, Feng Wei, Jinglei Chen, ZHENGHONG HAO, Bing Han
  • DATA-CUBE: Data Curriculum for Instruction-based Sentence Representation Learning
    Yingqian Min, Kun Zhou, Dawei Gao, Xin Zhao, He Hu, Yaliang Li
  • Combating Label Sparsity in Short Text Topic Modeling via Nearest Neighbor Augmentation
    Yang Lin, Xinyu Ma, Xin Gao, Ruiqing Li, Yasha Wang, Xu Chu
  • RefuteBench: Evaluating Refuting Instruction-Following for Large Language Models
    Jianhao Yan, Yun Luo, Yue Zhang
  • Complex Logical Query Answering by Calibrating Knowledge Graph Completion Models
    Changyi Xiao, Yixin Cao
  • Argument-Based Sentiment Analysis on Forward-Looking Statements
    Chin-Yi Lin, Chung-Chi Chen, Hen-Hsen Huang, Hsin-Hsi Chen
  • Paying More Attention to Source Context: Mitigating Unfaithful Translations from Large Language Model
    Hongbin Zhang, Kehai Chen, Xuefeng Bai, Yang Xiang, Min Zhang
  • Unveiling the Power of Integration: Block Diagram Summarization through Local-Global Fusion
    Shreyanshu Bhushan, Eun-Soo Jung, Minho Lee
  • MultiSQL: A Schema-Integrated Context-Dependent Text2SQL Dataset with Diverse SQL Operations
    Chunhui Li, Yifan Wang, Zhen Wu, Zhen Yu, Fei Zhao, Shujian Huang, Xinyu Dai
  • Towards Demonstration-Aware Large Language Models for Machine Translation
    Chen Li, Meishan Zhang, Xuebo Liu, Zhaocong Li, Derek F. Wong, Min Zhang
  • DADA: Distribution-Aware Domain Adaptation of PLMs for Information Retrieval
    Dohyeon Lee, Jongyoon Kim, seung-won hwang, Joonsuk Park
  • LLMs cannot find reasoning errors, but can correct them given the error location
    Gladys Tyen, Hassan Mansoor, Victor Carbune, Peter Chen, Tony Mak
  • Investigating the Impact of Data Contamination of Large Language Models in Text-to-SQL translation
    Federico Ranaldi, Elena Sofia Ruzzetti, Dario Onorati, Leonardo Ranaldi, Cristina Giannone, Andrea Favalli, Raniero Romagnoli, Fabio Massimo Zanzotto
  • ChartCheck: Explainable Fact-Checking over Real-World Chart Images
    Mubashara Akhtar, Nikesh Subedi, Vivek Gupta, Sahar Tahmasebi, Oana Cocarascu, Elena Simperl
  • CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling
    Chenhao Zhang, Renhao Li, Minghuan Tan, Min Yang, Jingwei Zhu, Di Yang, Jiahao Zhao, Guancheng Ye, Chengming Li, Xiping Hu
  • Tox-BART: Leveraging Toxicity Attributes for Explanation Generation of Implicit Hate Speech
    Neemesh Yadav, Sarah Masud, Vikram Goyal, Md Shad Akhtar, Tanmoy Chakraborty
  • TextGenSHAP: Scalable Post-Hoc Explanations in Text Generation with Long Documents
    James Enouen, Hootan Nakhost, Sayna Ebrahimi, Sercan O Arik, Yan Liu, Tomas Pfister
  • Balanced Data Sampling for Language Model Training with Clustering
    Yunfan Shao, Linyang Li, Zhaoye Fei, Hang Yan, Dahua Lin, Xipeng Qiu
  • Length Generalization of Causal Transformers without Position Encoding
    Jie Wang, Tao Ji, Yuanbin Wu, Hang Yan, Tao Gui, Qi Zhang, Xuanjing Huang, Xiaoling Wang
  • Unsupervised Sign Language Translation and Generation
    Zhengsheng Guo, Zhiwei He, Wenxiang Jiao, Xing Wang, Rui Wang, Kehai Chen, Zhaopeng Tu, Yong Xu, Min Zhang
  • Mitigating Data Scarcity in Semantic Parsing across Languages with the Multilingual Semantic Layer and its Dataset
    Abelardo Carlos Martinez Lorenzo, Pere-Lluís Huguet Cabot, Karim Ghonim, Lu Xu, Hee-Soo Choi, Alberte Fernández-Castro, Roberto Navigli
  • Efficient Sparse Attention needs Adaptive Token Release
    Chaoran zhang, Lixin Zou, Dan Luo, Xiangyang Luo, Zihao Li, Min Tang, Chenliang Li
  • Learning Fine-Grained Grounded Citations for Attributed Large Language Models
    Lei Huang, Xiaocheng Feng, Weitao Ma, Yuxuan Gu, Weihong Zhong, Xiachong Feng, Weijiang Yu, Weihua Peng, Duyu Tang, Dandan Tu, Bing Qin
  • ReLiK: Retrieve and LinK, Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget
    Riccardo Orlando, Pere-Lluís Huguet Cabot, Edoardo Barba, Roberto Navigli
  • Synergizing Large Language Models and Pre-Trained Smaller Models for Conversational Intent Discovery
    Jinggui Liang, Lizi Liao, Hao Fei, Jing Jiang
  • FENICE: Factuality Evaluation of summarization based on Natural language Inference and Claim Extraction
    Alessandro Scirè, Karim Ghonim, Roberto Navigli
  • Looking Right is Sometimes Right: Investigating the Capabilities of Decoder-only LLMs for Sequence Labeling
    David Dukić, Jan Snajder
  • mCSQA: Multilingual Commonsense Reasoning Dataset with Unified Creation Strategy by Language Models and Humans
    Yusuke Sakai, Hidetaka Kamigaito, Taro Watanabe
  • Dual-Stage Multi-Task Syntax-Oriented Pre-Training for Syntactically Controlled Paraphrase Generation
    Hongxu Liu, Xiaojie Wang, Jiashen Sun, Ke Zeng, Wan Guanglu
  • Demonstration Augmentation for Zero-shot In-context Learning
    Yi Su, Yunpeng Tai, Yixin Ji, Juntao Li, Yan Bowen, Min Zhang
  • Pushing the Limits of Zero-shot End-to-End Speech Translation
    Ioannis Tsiamas, Gerard I. Gállego, José A.R. Fonollosa, Marta R. Costa-jussà
  • NUMCoT: Numerals and Units of Measurement in Chain-of-Thought Reasoning using Large Language Models
    Ancheng Xu, Minghuan Tan, Lei Wang, Min Yang, Ruifeng Xu
  • Evaluating Large Language Models for Health-related Queries with Presuppositions
    Navreet Kaur, Monojit Choudhury, Danish Pruthi
  • Word Sense Linking: Disambiguating Outside the Sandbox
    Andrei Stefan Bejgu, Edoardo Barba, Luigi Procopio, Alberte Fernández-Castro, Roberto Navigli
  • Generalisation First, Memorisation Second? Memorisation Localisation for Natural Language Classification Tasks
    Verna Dankers, Ivan Titov
  • Towards Multi-Relational Multi-Hop Reasoning over Dense Temporal Knowledge Graphs
    Jian Liu, Zihe Liu, Xueqiang LYU, Peng Jin, Jinan Xu
  • Unsupervised Real-Time Hallucination Detection based on the Internal States of Large Language Models
    Weihang Su, Changyue Wang, Qingyao Ai, Yiran HU, Zhijing Wu, Yujia Zhou, Yiqun LIU
  • Progressive Tuning: Towards Generic Sentiment Abilities for Large Language Models
    Guiyang Hou, Yongliang Shen, Weiming Lu
  • Fooling the Textual Fooler via Randomizing Latent Representations
    Duy Cao Hoang, Nguyen Hung-Quang, Saurav Manchanda, Minlong Peng, Kok-Seng Wong, Khoa D Doan
  • FOCUS: Forging Originality through Contrastive Use in Self-Plagiarism for Language Models
    Kaixin Lan, Tao Fang, Derek F. Wong, Yabo Xu, Lidia S. Chao, Cecilia Guanfang Zhao
  • Amanda: Adaptively Modality-Balanced Domain Adaptation for Multimodal Emotion Recognition
    Xinxin Zhang, Jun Sun, Simin Hong, Taihao Li
  • MedREQAL: Examining Medical Knowledge Recall of Large Language Models via Question Answering
    Juraj Vladika, Phillip Schneider, Florian Matthes
  • Deepfake Defense: Constructing and Evaluating a Specialized Urdu Deepfake Audio Dataset
    Sheza Munir, Wassay Sajjad, Mukeet Raza, Emaan Mujahid Abbas, Abdul Hameed Azeemi, Ihsan Ayyub Qazi, Agha Ali Raza
  • Recognizing Everything from All Modalities at Once: Grounded Multimodal Universal Information Extraction
    Meishan Zhang, Hao Fei, Bin Wang, Shengqiong Wu, Yixin Cao, Fei Li, Min Zhang
  • Enhanced Visual Instruction Tuning with Synthesized Image-Dialogue Data
    Yanda Li, Chi Zhang, Gang Yu, Wanqi Yang, Zhibin Wang, BIN FU, Guosheng Lin, Chunhua Shen, Ling Chen, Yunchao Wei
  • Modeling Overregularization in Children with Small Language Models
    Akari Haga, Saku Sugawara, Akiyo Fukatsu, Miyu Oba, Hiroki Ouchi, Taro Watanabe, Yohei Oseki
  • Harnessing Large Language Models as Post-hoc Correctors
    Zhiqiang Zhong, Kuangyu Zhou, Davide Mottin
  • Debatrix: Multi-dimensional Debate Judge with Iterative Chronological Analysis Based on LLM
    Jingcong Liang, Rong Ye, Meng Han, Ruofei Lai, Xinyu Zhang, Xuanjing Huang, zhongyu wei
  • CycleAlign: Iterative Distillation from Black-box LLM to White-box Models for Better Human Alignment
    Jixiang Hong, Quan Tu, Changyu Chen, Gao Xing, Ji Zhang, Rui Yan
  • Towards a new research agenda for multimodal enterprise document understanding: What are we missing?
    Armineh Nourbakhsh, Sameena Shah, Carolyn Rose
  • CAUSE: Counterfactual Assessment of User Satisfaction Estimation in Task-Oriented Dialogue Systems
    Amin Abolghasemi, Zhaochun Ren, Arian Askari, Mohammad Aliannejadi, Maarten de Rijke, Suzan Verberne
  • Measuring Retrieval Complexity in Question Answering Systems
    Matteo Gabburo, Nicolaas Paul Jedema, Siddhant Garg, Leonardo F. R. Ribeiro, Alessandro Moschitti
  • Combining Hierachical VAEs with LLMs for clinically meaningful timeline summarisation in social media
    Jiayu Song, Jenny Chim, Adam Tsakalidis, Julia Ive, Dana Atzil-Slonim, Maria Liakata
  • PIXAR: Auto-Regressive Language Modeling in Pixel Space
    Yintao Tai, Xiyang Liao, Alessandro Suglia, Antonio Vergari
  • Sparsity-Accelerated Training for Large Language Models
    Da Ma, Lu Chen, Pengyu Wang, Hongshen Xu, Hanqi Li, Liangtai Sun, Su Zhu, Shuai Fan, Kai Yu
  • Do Language Models Exhibit Human-like Structural Priming Effects?
    Jaap Jumelet, Willem Zuidema, Arabella Sinclair
  • RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models
    Noah Wang, Z.Y. Peng, Haoran Que, Jiaheng Liu, Wangchunshu Zhou, Yuhan Wu, Hongcheng Guo, Ruitong Gan, Zehao Ni, Jian Yang, Man Zhang, Zhaoxiang Zhang, Wanli Ouyang, Ke Xu, Wenhao Huang, Jie Fu, Junran Peng
  • LangSuit·E: Planning, Controlling and Interacting with Large Language Models in Embodied Text Environments
    Zixia Jia, Mengmeng Wang, Baichen Tong, Song-Chun Zhu, Zilong Zheng
  • MAPLE: Multilingual Evaluation of Parameter Efficient Finetuning of Large Language Models
    Divyanshu Aggarwal, Ashutosh Sathe, Ishaan Watts, Sunayana Sitaram
  • MoE-SLU: Towards ASR-Robust Spoken Language Understanding via Mixture-of-Experts
    Xuxin Cheng, Zhihong Zhu, Xianwei Zhuang, Zhanpeng Chen, Zhiqi Huang, Yuexian Zou
  • Multi-Task Transfer Matters During Instruction-Tuning
    David Mueller, Mark Dredze, Nicholas Andrews
  • What Makes a Good Order of Examples in In-Context Learning
    Qi Guo, Leiyu Wang, Yidong Wang, Wei Ye, Shikun Zhang
  • BloomVQA: Assessing Hierarchical Multi-modal Comprehension
    Yunye Gong, Robik Singh Shrestha, Jared Claypoole, Michael Cogswell, Arijit Ray, Christopher Kanan, Ajay Divakaran
  • AttributionBench: How Hard is Automatic Attribution Evaluation?
    Yifei Li, Xiang Yue, Zeyi Liao, Huan Sun
  • Diffusion Guided Language Modeling
    Justin Lovelace, Varsha Kishore, Yiwei Chen, Kilian Q Weinberger
  • InstructEd: Soft-Instruction Tuning for Model Editing with Hops
    XiaoQi Han, Ru Li, Xiaoli Li, Jiye Liang, Zifang Zhang, Jeff Z. Pan
  • TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback
    Eunseop Yoon, Hee Suk Yoon, SooHwan Eom, Gunsoo Han, Daniel Wontae Nam, Daejin Jo, Kyoung-Woon On, Mark A. Hasegawa-Johnson, Sungwoong Kim, Chang D. Yoo
  • Found in the middle: Calibrating Positional Attention Bias Improves Long Context Utilization
    Cheng-Yu Hsieh, Yung-Sung Chuang, Chun-Liang Li, Zifeng Wang, Long Le, Abhishek Kumar, James R. Glass, Alexander Ratner, Chen-Yu Lee, Ranjay Krishna, Tomas Pfister
  • S3-DST: Structured Open-Domain Dialogue Segmentation and State Tracking in the Era of LLMs
    Sarkar Snigdha Sarathi Das, Chirag Shah, Mengting Wan, Jennifer Neville, Longqi Yang, Reid Andersen, Georg Buscher, Tara Safavi
  • Set the Clock: Temporal Alignment of Pretrained Language Models
    Bowen Zhao, Zander Brumbaugh, Yizhong Wang, Hannaneh Hajishirzi, Noah A. Smith
  • From One to Many: Expanding the Scope of Toxicity Mitigation in Language Models
    Beyza Ermis, Luiza Amador Pozzobon, Sara Hooker, Patrick Lewis
  • Here’s a Free Lunch: Sanitizing Backdoored Models with Model Merge
    Ansh Arora, Xuanli He, Maximilian Mozes, Srinibas Swain, Mark Dras, Qiongkai Xu
  • Enhancing Sentence Simplification in Portuguese: Leveraging Paraphrases, Context, and Linguistic Features
    ARTHUR MARIANO ROCHA DE AZEVEDO SCALERCIO, Maria José Bocorny Finatto, Aline Paes
  • How Far can 100 Samples Go? Unlocking Zero-Shot Translation with Tiny Multi-Parallel Data
    Di Wu, Shaomu Tan, Yan Meng, David Stap, Christof Monz
  • Toward Reliable Ad-hoc Scientific Information Extraction: A Case Study on Two Materials Dataset
    Satanu Ghosh, Neal R Brodnik, Carolina Frey, Collin Holgate, Tresa Pollock, Samantha Daly, Samuel Carton
  • Structural Optimization Ambiguity and Simplicity Bias in Unsupervised Neural Grammar Induction
    Jinwook Park, Kangil Kim
  • LMDX: Language Model-based Document Information Extraction and Localization
    Vincent Perot, Kai Kang, Florian Luisier, Guolong Su, Xiaoyu Sun, Ramya Sree Boppana, Zilong Wang, Zifeng Wang, Jiaqi Mu, Hao Zhang, Chen-Yu Lee, Nan Hua
  • DBQR-QA: A Question Answering Dataset on a Hybrid of Database Querying and Reasoning
    Rungsiman Nararatwong, Chung-Chi Chen, Natthawut Kertkeidkachorn, Hiroya Takamura, Ryutaro Ichise
  • NoteChat: A Dataset of Synthetic Patient-Physician Conversations Conditioned on Clinical Notes
    Junda Wang, Zonghai Yao, Zhichao Yang, Huixue Zhou, Rumeng Li, Xun Wang, Yucheng XU, hong yu
  • Model Editing at Scale leads to Gradual and Catastrophic Forgetting
    Akshat Gupta, Anurag Rao, Gopala Anumanchipalli
  • 3MVRD: Multimodal Multi-task Multi-teacher Visually-Rich Form Document Understanding
    Yihao Ding, Lorenzo Vaiani, Caren Han, Jean Lee, Paolo Garza, Josiah Poon, Luca Cagliero
  • Faithful Persona-based Conversational Dataset Generation with Large Language Models
    Pegah Jandaghi, Xianghai Sheng, Xinyi Bai, Jay Pujara, Hakim Sidahmed
  • Vision-Flan: Scaling Human-Labeled Tasks in Visual Instruction Tuning
    Zhiyang Xu, Chao Feng, Rulin Shao, Trevor Ashby, Ying Shen, Di Jin, Yu Cheng, Qifan Wang, Lifu Huang
  • Challenges to Evaluating the Generalization of Coreference Resolution Models: A Measurement Modeling Perspective
    Ian Porada, Alexandra Olteanu, Kaheer Suleman, Adam Trischler, Jackie CK Cheung
  • SAGA: A Participant-specific Examination of Story Alternatives and Goal Applicability for a Deeper Understanding of Complex Events
    Sai P Vallurupalli, Katrin Erk, Francis Ferraro
  • SLIDE: A Framework Integrating Small and Large Language Models for Open-Domain Dialogues Evaluation
    Kun Zhao, Bohao Yang, Chen Tang, Chenghua Lin, Liang Zhan
  • Deep Exploration of Cross-Lingual Zero-Shot Generalization in Instruction Tuning
    Janghoon Han, Changho Lee, Joongbo Shin, Stanley Jungkyu Choi, Honglak Lee, Kyunghoon Bae
  • What Makes Language Models Good-enough?
    Daiki Asami, Saku Sugawara
  • Refining Corpora from a Model Calibration Perspective for Chinese Spelling Correction
    Dingyao Yu, Yang An, Wei Ye, xiongfeng xiao, Shaoguang Mao, Tao Ge, Shikun Zhang
  • CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples
    Jianrui Zhang, Mu Cai, Tengyang Xie, Yong Jae Lee
  • Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation with Large Language Models
    Ran Xu, Hejie Cui, Yue Yu, Xuan Kan, Wenqi Shi, Yuchen Zhuang, May Dongmei Wang, Wei Jin, Joyce C. Ho, Carl Yang
  • Textless Acoustic Model with Self-Supervised Distillation for Noise-Robust Expressive Speech-to-Speech Translation
    Min-Jae Hwang, Ilia Kulikov, Benjamin N Peloquin, Hongyu Gong, Peng-Jen Chen, Ann Lee
  • Knowledge-Infused Legal Wisdom: Navigating LLM Consultation through the Lens of Diagnostics and Positive-Unlabeled Reinforcement Learning
    Yang Wu, Chenghao Wang, Ece Gumusel, Xiaozhong Liu
  • TELLER: A Trustworthy Framework for Explainable, Generalizable and Controllable Fake News Detection
    Hui Liu, Wenya Wang, Haoru Li, Haoliang Li
  • Tailoring with Targeted Precision: Edit-Based Agents for Open-Domain Procedure Customization
    Yash Kumar Lal, Li Zhang, Faeze Brahman, Bodhisattwa Prasad Majumder, Peter Clark, Niket Tandon
  • A Meta-Learning Perspective on Transformers for Causal Language Modeling
    Xinbo Wu, Lav R. Varshney
  • PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs
    Rongzhi Zhang, Jiaming Shen, Tianqi Liu, Haorui Wang, Zhen Qin, feng han, Jialu Liu, Simon Baumgartner, Michael Bendersky, Chao Zhang
  • Small Language Models Need Strong Verifiers to Self-Correct Reasoning
    Yunxiang Zhang, Muhammad Khalifa, Lajanugen Logeswaran, Jaekyeom Kim, Moontae Lee, Honglak Lee, Lu Wang
  • Hire a Linguist!: Learning Endangered Languages in LLMs with In-Context Linguistic Descriptions
    Kexun Zhang, Yee Man Choi, Zhenqiao Song, Taiqi He, William Yang Wang, Lei Li
  • From Tarzan to Tolkien: Controlling the Language Proficiency Level of LLMs for Content Generation
    Ali Malik, Stephen Mayhew, Christopher J Piech, Klinton Bicknell
  • From Representational Harms to Quality-of-Service Harms: A Case Study on Llama 2 Safety Safeguards
    Khaoula Chehbouni, Megha Roshan, Emmanuel Ma, Futian Andrew Wei, Afaf Taik, Jackie CK Cheung, Golnoosh Farnadi
  • CToolEval: A Chinese Benchmark for LLM-Powered Agent Evaluation in Real-World API Interactions
    Zishan Guo, Yufei Huang, Deyi Xiong
  • Token Alignment via Character Matching for Subword Completion
    Ben Athiwaratkun, Shiqi Wang, Mingyue Shang, YUCHEN TIAN, Zijian Wang, Sujan Kumar Gonugondla, Sanjay Krishna Gouda, Robert Kwiatkowski, Ramesh Nallapati, Parminder Bhatia, Bing Xiang
  • emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
    Ziyang Ma, Zhisheng Zheng, Jiaxin Ye, Jinchao Li, Zhifu Gao, ShiLiang Zhang, Xie Chen
  • Language-Informed Beam Search Decoding for Multilingual Machine Translation
    Yilin Yang, Stefan Lee, Prasad Tadepalli
  • RA-LoRA: Rank-Adaptive Parameter-Efficient Fine-Tuning for Accurate 2-bit Quantized Large Language Models
    Minsoo Kim, Sihwa Lee, Wonyong Sung, Jungwook Choi
  • The PGNSC Benchmark: How Do We Predict Where Information Spreads?
    Alexander K Taylor, Wei Wang
  • STARLING: Self-supervised Training of Text-based Reinforcement Learning Agent with Large Language Models
    Shreyas Basavatia, Keerthiram Murugesan, Shivam Ratnakar
  • Protecting Privacy Through Approximating Optimal Parameters for Sequence Unlearning in Language Models
    Dohyun Lee, Daniel Rim, Minseok Choi, Jaegul Choo
  • Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding
    Xintong Wang, Jingheng Pan, Liang Ding, Chris Biemann
  • Fine-tuning Language Models for Joint Rewriting and Completion of Code with Potential Bugs
    Dingmin Wang, Jinman Zhao, Hengzhi Pei, Samson Tan, Sheng Zha
  • A Critical Study of What Code-LLMs (Do Not) Learn
    Abhinav Anand, Shweta Verma, Krishna Narasimhan, Mira Mezini
  • Visual In-Context Learning for Large Vision-Language Models
    Yucheng Zhou, Xiang Li, Qianning Wang, Jianbing Shen
  • SCALE: Synergized Collaboration of Asymmetric Language Translation Engines
    Xin Cheng, Xun Wang, Tao Ge, Si-Qing Chen, Furu Wei, Dongyan Zhao, Rui Yan
  • No perspective, no perception!! Perspective-aware Healthcare Answer Summarization
    Gauri Naik, Sharad Chandakacherla, Shweta Yadav, Md Shad Akhtar
  • Retrieval-Augmented Retrieval: Large Language Models are Strong Zero-Shot Retriever
    Tao Shen, Guodong Long, Xiubo Geng, Chongyang Tao, Yibin Lei, Tianyi Zhou, Michael Blumenstein, Daxin Jiang
  • A Survey on Predicting the Factuality and the Bias of News Media
    Preslav Nakov, Jisun An, Haewoon Kwak, Muhammad Arslan Manzoor, Zain Muhammad Mujahid, Husrev Taha Sencar
  • Semantic Compression for Word and Sentence Embeddings using Discrete Wavelet Transform
    Rana Salama, Abdou Youssef, Mona T. Diab
  • Improving Multi-hop Logical Reasoning in Knowledge Graphs with Context-Aware Query Representation Learning
    Jeonghoon Kim, Heesoo Jung, Hyeju Jang, Hogun Park
  • ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models
    Yuzhao Heng, Chunyuan Deng, Yitong Li, Yue Yu, Yinghao Li, Rongzhi Zhang, Chao Zhang
  • Defending LLMs against Jailbreaking Attacks via Backtranslation
    Yihan Wang, Zhouxing Shi, Andrew Bai, Cho-Jui Hsieh
  • A Large Collection of Model-generated Contradictory Responses for Consistency-aware Dialogue Systems
    Shiki Sato, Reina Akama, Jun Suzuki, Kentaro Inui
  • Exploring Reasoning Biases in Large Language Models Through Syllogism: Insights from the NeuBAROCO Dataset
    Kentaro Ozeki, Risako Ando, Takanobu Morishita, Hirohiko Abe, Koji Mineshima, Mitsuhiro Okada
  • Unveiling the Spectrum of Data Contamination in Language Model: A Survey from Detection to Remediation
    Chunyuan Deng, Yilun Zhao, Yuzhao Heng, Yitong Li, Jiannan Cao, Xiangru Tang, Arman Cohan
  • DIMSIM: Distilled Multilingual Critics for Indic Text Simplification
    Sneha Mondal, Ritika, Ashish Sunil Agrawal, Preethi Jyothi, Aravindan Raghuveer
  • MATTER: Memory-Augmented Transformer Using Heterogeneous Knowledge Sources
    Dongkyu Lee, Chandana Satya Prakash, Jack FitzGerald, Jens Lehmann
  • Ask LLMs Directly, “What shapes your bias?”: Measuring Social Bias in Large Language Models
    Jisu Shin, Hoyun Song, Huije Lee, Soyeong Jeong, Jong C. Park
  • Chain-of-History Reasoning for Temporal Knowledge Graph Forecasting
    Yuwei Xia, Ding Wang, Qiang Liu, Liang Wang, Shu Wu, Xiao-Yu Zhang
  • Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate Controllable Controversial Statements
    Ming Li, Jiuhai Chen, Lichang Chen, Tianyi Zhou
  • Label-aware Hard Negative Sampling Strategies with Momentum Contrastive Learning for Implicit Hate Speech Detection
    Jaehoon Kim, Seungwan Jin, Sohyun Park, Someen Park, Kyungsik Han
  • Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
    Ming Li, Lichang Chen, Jiuhai Chen, Shwai He, Jiuxiang Gu, Tianyi Zhou
  • Selective Prompting Tuning for Personalized Conversations with LLMs
    Qiushi Huang, Xubo Liu, Tom Ko, Bo Wu, Wenwu Wang, Yu Zhang, Lilian Tang
  • Sowing the Wind, Reaping the Whirlwind: The Impact of Editing Language Models
    Rima Hazra, Sayan Layek, Somnath Banerjee, Soujanya Poria
  • ContextBLIP: Doubly Contextual Alignment for Contrastive Image Retrieval from Linguistically Complex Descriptions
    Honglin Lin, Siyu Li, Guoshun Nan, Chaoyue Tang, Xueting Wang, Jingxin Xu, Rong Yankai, zhouzhili, Yutong Gao, Qimei Cui, Xiaofeng Tao
  • PuzzleVQA: Diagnosing Multimodal Reasoning Challenges of Language Models with Abstract Visual Patterns
    Yew Ken Chia, Vernon Toh, Deepanway Ghosal, Lidong Bing, Soujanya Poria
  • How Do Moral Emotions Shape Political Participation? A Cross-Cultural Analysis of Online Petitions Using Language Models
    Jaehong Kim, Chaeyoon Jeong, Seongchan Park, Meeyoung Cha, Wonjae Lee
  • VillagerAgent: A Graph-Based Multi-Agent Framework for Coordinating Complex Task Dependencies in Minecraft
    Yubo Dong, Xukun Zhu, Zhengzhe Pan, Linchao Zhu, Yi Yang
  • CF-TCIR: A Compositor-Free Framework for Hierarchical Text-Conditioned Image Retrieval
    Yuchen Yang, Yu Wang, Yanfeng Wang
  • DMIN: A Discourse-specific Multi-granularity Integration Network for Conversational Aspect-based Sentiment Quadruple Analysis
    Peijie Huang, Xisheng Xiao, Yuhong Xu, Jiawei Chen
  • FragRel: Exploiting Fragment-level Relations in the External Memory of Large Language Models
    Xihang Yue, Linchao Zhu, Yi Yang
  • On the Robustness of Document-Level Relation Extraction Models to Entity Name Variations
    Shiao Meng, Xuming Hu, Aiwei Liu, Fukun Ma, Yawen Yang, Shuang Li, Lijie Wen
  • RESEMO: A Benchmark Chinese Dataset for Studying Responsive Emotion from Social Media Content
    Bo Hu, Meng Zhang, Chenfei Xie, Yuanhe Tian, Yan Song, Zhendong Mao
  • EHR-SeqSQL : A Sequential Text-to-SQL Dataset For Interactively Exploring Electronic Health Records
    Jaehee Ryu, Seonhee Cho, Gyubok Lee, Edward Choi
  • RePair: Automated Program Repair with Process-based Feedback
    Yuze Zhao, Zhenya Huang, Yixiao Ma, Rui Li, Kai Zhang, Hao Jiang, Qi Liu, Linbo Zhu, Yu Su
  • Concise and Precise Context Compression for Tool-Using Language Models
    Yang Xu, Yunlong Feng, Honglin Mu, Yutai Hou, Yitong Li, Xinghao Wang, Wanjun Zhong, Zhongyang Li, Dandan Tu, Qingfu Zhu, Min Zhang, Wanxiang Che
  • MedDec: A Dataset for Extracting Medical Decisions from Discharge Summaries
    Mohamed Elgaar, Jiali Cheng, Nidhi Vakil, Hadi Amiri, Leo Anthony Celi

Short Papers

  • AFPQ: Asymmetric Floating Point Quantization for LLMs
    Yijia Zhang, Sicheng Zhang, Shijie Cao, DaYou Du, Jianyu Wei, Ting Cao, Ningyi Xu
  • A Grounded Preference Model for LLM Alignment
    Tahira Naseem, Guangxuan Xu, Sarathkrishna Swaminathan, Asaf Yehudai, Subhajit Chaudhury, Radu Florian, Ramón Fernandez Astudillo, Asim Munawar
  • How Important is a Language Model for Low-resource ASR?
    Zoey Liu, Nitin Venkateswaran, Eric Le Ferrand, Emily Prud’hommeaux
  • InfiMM: Advancing Multimodal Understanding with an Open-Sourced Visual Language Model
    Haogeng Liu, Quanzeng You, Yiqi Wang, Xiaotian Han, Bohan Zhai, Yongfei Liu, Wentao Chen, Yiren Jian, Yunzhe Tao, Jianbo Yuan, Ran He, Hongxia Yang
  • Effective In-Context Example Selection through Data Compression
    ZhongXiang Sun, Kepu Zhang, Haoyu Wang, Xiao Zhang, Jun Xu
  • Exploring Mathematical Extrapolation of Large Language Models with Synthetic Data
    Haolong Li, Yu Ma, Yinqi Zhang, Chen Ye, Jie chen
  • Realistic Evaluation of Toxicity in Large Language Models
    Tinh Son Luong, Thanh-Thien Le, Linh Van Ngo, Thien Huu Nguyen
  • Learning Job Title Representation from Job Description Aggregation Network
    Napat Laosaengpha, Thanit Tativannarat, Chawan Piansaddhayanon, Attapol Rutherford, Ekapol Chuangsuwanich
  • Flexible Weight Tuning and Weight Fusion Strategies for Continual Named Entity Recognition
    Yahan Yu, Duzhen Zhang, Xiuyi Chen, Chenhui Chu
  • An Empirical Study on the Characteristics of Bias upon Context Length Variation for Bangla
    Jayanta Sadhu, Ayan Antik Khan, Abhik Bhattacharjee, Rifat Shahriyar
  • SPAGHETTI: Open-Domain Question Answering from Heterogeneous Data Sources with Retrieval and Semantic Parsing
    Heidi Chenyu Zhang, Sina Semnani, Farhad Ghassemi, Jialiang Xu, Shicheng Liu, Monica Lam
  • k-SemStamp: A Clustering-Based Semantic Watermark for Detection of Machine-Generated Text
    Abe Bohan Hou, Jingyu Zhang, Yichen Wang, Daniel Khashabi, Tianxing He
  • ColorSwap: A Color and Word Order Dataset for Multimodal Evaluation
    Jirayu Burapacheep, Ishan Gaur, Agam Bhatia, Tristan Thrush
  • Revisiting OPRO: The Limitations of Small-Scale LLMs as Optimizers
    Tuo Zhang, Jinyue Yuan, Salman Avestimehr
  • A Shocking Amount of the Web is Machine Translated: Insights from Multi-Way Parallelism
    Brian Thompson, Mehak Preet Dhaliwal, Peter Frisch, Tobias Domhan, Marcello Federico
  • RankMean: Module-Level Importance Score for Merging Fine-tuned LLM Models
    Gabriel Jacob Perin, Xuxi Chen, Shusen Liu, Bhavya Kailkhura, Zhangyang Wang, Brian Gallagher
  • DEBATE: Devil’s Advocate-Based Assessment and Text Evaluation
    Alex G. Kim, Keonwoo Kim, Sangwon Yoon
  • SocialBench: Sociality Evaluation of Role-Playing Conversational Agents
    Hongzhan Chen, Hehong Chen, Ming Yan, Wenshen Xu, Gao Xing, Weizhou Shen, Xiaojun Quan, Chenliang Li, Ji Zhang, Fei Huang
  • From Model-centered to Human-Centered: Revision Distance as a Metric for Text Evaluation in LLMs-based Applications
    Yongqiang Ma, Lizhi Qing, Jiawei Liu, Yangyang Kang, Yue Zhang, Wei Lu, Xiaozhong Liu, Qikai Cheng
  • VISPool: Enhancing Transformer Encoders with Vector Visibility Graph Neural Networks
    Tuna Alikaşifoğlu, Arda Can Aras, Aykut Koc
  • Accurate and Nuanced Open-QA Evaluation Through Textual Entailment
    Peiran Yao, Denilson Barbosa
  • Dictionary-Aided Translation for Handling Multi-Word Expressions in Low-Resource Languages
    Antonios Dimakis, Stella Markantonatou, Antonios Anastasopoulos
  • Selective Prefix Tuning for Pre-trained Language Models
    Hongyi Zhang, Zuchao Li, Ping Wang, hai zhao
  • Towards Better Utilization of Multi-Reference Training Data for Chinese Grammatical Error Correction
    Yumeng Liu, Zhenghua Li, HaoChen Jiang, Bo Zhang, Chen Li, Ji Zhang
  • Concept-Best-Matching: Evaluating Compositionality In Emergent Communication
    Boaz Carmeli, Yonatan Belinkov, Ron Meir
  • Pro-Woman, Anti-Man? Identifying Gender Bias in Stance Detection
    Yingjie Li, Yue Zhang
  • Likelihood-based Mitigation of Evaluation Bias in Large Language Models
    Masanari Ohi, Masahiro Kaneko, Ryuto Koike, Mengsay Loem, Naoaki Okazaki
  • Aligning Speech Segments Beyond Pure Semantics
    Kevin Heffernan, Artyom Kozhevnikov, Loic Barrault, Alexandre Mourachko, Holger Schwenk
  • Improving In-Context Learning with Prediction Feedback for Sentiment Analysis
    Hongling Xu, Qianlong Wang, Yice Zhang, Min Yang, Xi Zeng, Bing Qin, Ruifeng Xu
  • MovieSum: An Abstractive Summarization Dataset for Movie Screenplays
    Rohit Saxena, Frank Keller
  • Context Length Extension via Generalized Extrapolation Scale
    Linhan Li, Zhang Huaping
  • Selectively Answering Visual Questions
    Julian Martin Eisenschlos, Hernán Maina, Guido Ivetta, Luciana Benotti
  • Semantics or spelling? Probing contextual word embeddings with orthographic noise
    Jacob A. Matthews, John R Starr, Marten Van Schijndel
  • Automated Detection and Analysis of Data Practices Using A Real-World Corpus
    Mukund Srinath, Pranav Narayanan Venkit, Maria Badillo, Florian Schaub, C. Lee Giles, Shomir Wilson
  • Incorporating Syntax and Lexical Knowledge to Multilingual Sentiment Classification on Large Language Models
    Hiroshi Kanayama, YANG ZHAO, Ran Iwamoto, Takuya Ohko
  • Which Information Matters? Dissecting Human-written Multi-document Summaries with Partial Information Decomposition
    Laura Mascarell, Yan LHomme, Majed El Helou
  • Evaluating Large Language Models on Wikipedia-Style Survey Generation
    Fan Gao, Hang Jiang, Rui Yang, Qingcheng Zeng, Jinghui Lu, Moritz Blum, Tianwei She, Yuang Jiang, Irene Li
  • Predicting Narratives of Climate Obstruction in Social Media Advertising
    Harri Rowlands, Gaku Morio, Dylan Tanner, Christopher D Manning
  • Model Editing by Standard Fine-Tuning
    Govind Krishnan Gangadhar, Karl Stratos
  • Mitigating Hallucinations in Large Vision-Language Models (LVLMs) via Language-Contrastive Decoding (LCD)
    Avshalom Manevich, Reut Tsarfaty
  • HeSum: a Novel Dataset for Abstractive Text Summarization in Hebrew
    Tzuf Paz-Argaman, Itai Mondshine, Asaf Achi Mordechai, Reut Tsarfaty
  • It is Simple Sometimes: A Study On Improving Aspect-Based Sentiment Analysis Performance
    Laura Cabello, Uchenna Akujuobi
  • RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering
    Zihan Zhang, Meng Fang, Ling Chen
  • LEIA: Facilitating Cross-lingual Knowledge Transfer in Language Models with Entity-based Data Augmentation
    Ikuya Yamada, Ryokan Ri
  • Do Zombies Understand? A Choose-Your-Own-Adventure Exploration of Machine Cognition
    Ariel Goldstein, Gabriel Stanovsky
  • Self-Consistent Reasoning-based Aspect-Sentiment Quad Prediction with Extract-Then-Assign Strategy
    Jieyong Kim, Ryang Heo, Yongsik Seo, SeongKu Kang, Jinyoung Yeo, Dongha Lee
  • “My Answer is C”: First-Token Probabilities Do Not Match Text Answers in Instruction-Tuned Language Models
    Xinpeng Wang, Bolei Ma, Chengzhi Hu, Leon Weber-Genzel, Paul Röttger, Frauke Kreuter, Dirk Hovy, Barbara Plank
  • Building Bridges: A Dataset for Evaluating Gender-Fair Machine Translation into German
    Manuel Lardelli, Giuseppe Attanasio, Anne Lauscher
  • Prompt Chaining or Stepwise Prompt? Refinement in Text Summarization
    Shichao Sun, Ruifeng Yuan, Ziqiang Cao, Wenjie Li, Pengfei Liu
  • From Zero to Hero: Cold-Start Anomaly Detection
    Tal Reiss, George Kour, Naama Zwerdling, Ateret Anaby Tavor, Yedid Hoshen
  • LSTPrompt: Large Language Models as Zero-Shot Time Series Forecasters by Long-Short-Term Prompting
    haoxin liu, Zhiyuan Zhao, Jindong Wang, Harshavardhan Kamarthi, B. Aditya Prakash
  • The State of Relation Extraction Data Quality: Is Bigger Always Better?
    Erica Cai, Brendan O’Connor
  • Linear Cross-Lingual Mapping of Sentence Embeddings
    Oleg Vasilyev, Fumika Isono, John Bohannon
  • ULTRA: Unleash LLMs’ Potential for Event Argument Extraction through Hierarchical Modeling and Pair-wise Self-Refinement
    Xinliang Frederick Zhang, Carter Blum, Temma Choji, Shalin Shah, Alakananda Vempala
  • Exploring Domain Robust Lightweight Reward Models based on Router Mechanism
    Hyuk Namgoong, Jeesu Jung, Sangkeun Jung, YoonHyung Roh
  • “Get Their Hands Dirty, Not Mine’’: On Researcher-Annotator Collaboration and the Agency of Annotators
    Shengqi Zhu, Jeffrey Rzeszotarski
  • GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation
    Yi Zong, Xipeng Qiu
  • Revisiting Parallel Context Windows: A Frustratingly Simple Alternative and Chain-of-Thought Deterioration
    Kejuan Yang, Xiao Liu, Kaiwen Men, Aohan Zeng, Yuxiao Dong, Jie Tang
  • Large Language Models Can Learn Representation in Natural Language
    Yiduo Guo, Yaobo Liang, Dongyan Zhao, Nan Duan
  • CTC-based Non-autoregressive Textless Speech-to-Speech Translation
    Qingkai Fang, Zhengrui Ma, Yan Zhou, Min zhang, Yang Feng
  • Evidence Retrieval is almost All You Need for Fact Verification
    Liwen Zheng, Chaozhuo Li, Xi Zhang, Yu-Ming Shang, Feiran Huang, Haoran Jia
  • Pushing the Limits of Low-Resource NER Using LLM Artificial Data Generation
    Joan Santoso, Patrick Sutanto, Billy Kelvianto Cahyadi, Esther Irawati Setiawan
  • Integrating Multi-scale Contextualized Information for Byte-based Neural Machine Translation
    Langlin Huang, Yang Feng
  • MELD-ST: An Emotion-aware Speech Translation Dataset
    Sirou Chen, Sakiko Yahata, Shuichiro Shimizu, Zhengdong Yang, Yihang Li, Chenhui Chu, Sadao Kurohashi
  • Designing Informative Metrics for Few-Shot Example Selection
    Rishabh Adiga, Lakshmi Subramanian, Varun Chandrasekaran
  • Chain-of-Quizzes: Pedagogy-inspired Example Selection in In-Context-Learning
    Yiquan Wu, Anlai Zhou, Yuhang Liu, Yifei Liu, Adam Jatowt, Weiming Lu, Jun Xiao, Kun Kuang
  • It’s Not Easy Being Wrong: Large Language Models Struggle with Process of Elimination Reasoning
    Nishant Balepur, Shramay Palta, Rachel Rudinger
  • Centroid-Based Efficient Minimum Bayes Risk Decoding
    Hiroyuki Deguchi, Yusuke Sakai, Hidetaka Kamigaito, Taro Watanabe, Hideki Tanaka, Masao Utiyama
  • When is a Language Process a Language Model?
    Li Du, Holden Lee, Jason Eisner, Ryan Cotterell
  • Definition Generation for Automatically Induced Semantic Frame
    Yi Han, Ryohei Sasano, Koichi Takeda
  • Don’t Augment, Rewrite? Assessing Abusive Language Detection with Synthetic Data
    Camilla Casula, Elisa Leonardelli, Sara Tonelli
  • AustroTox: A Dataset for Target-Based Austrian German Offensive Language Detection
    Pia Pachinger, Janis Goldzycher, Anna Maria Planitzer, Wojciech Kusa, Allan Hanbury, Julia Neidhardt
  • LC4EE: LLMs as Good Corrector for Event Extraction
    Mengna Zhu, Kaisheng Zeng, JibingWu, Lihua Liu, Hongbin Huang, Lei Hou, Juanzi Li
  • Beyond Text: Leveraging Multi-Task Learning and Cognitive Appraisal Theory for Post-Purchase Intention Analysis
    Gerard Christopher Yeo, Shaz Furniturewala, Kokil Jaidka
  • Diving Deep into the Motion Representation of Video-Text Models
    Chinmaya Devaraj, Cornelia Fermuller, Yiannis Aloimonos
  • Argument-Aware Approach To Event Linking
    I-Hung Hsu, Zihan Xue, Nilay Pochhi, Sahil Bansal, Prem Natarajan, Jayanth Srinivasa, Nanyun Peng
  • Understanding the Impacts of Language Technologies’ Performance Disparities on African American Language Speakers
    Jay L. Cunningham, Su Lin Blodgett, Michael A. Madaio, Hal Daumé III, Christina Harrington, Hanna Wallach
  • Language Model Priors and Data Augmentation Strategies for Low-resource Machine Translation: A Case Study Using Finnish to Northern Sámi
    Jonne Sälevä, Constantine Lignos
  • Few-shot Dialogue Strategy Learning for Motivational Interviewing via Inductive Reasoning
    Zhouhang Xie, Bodhisattwa Prasad Majumder, Mengjie Zhao, Yoshinori Maeda, Keiichi Yamada, Hiromi Wakaki, Julian McAuley
  • Enhancing Hallucination Detection through Perturbation-Based Synthetic Data Generation in System Responses
    Dongxu Zhang, Varun Prashant Gangal, Barrett Martin Lattimer, Yi Yang
  • Referral Augmentation for Zero-Shot Information Retrieval
    Michael Tang, Shunyu Yao, John Yang, Karthik R Narasimhan
  • Real World Conversational Entity Linking Requires More Than Zero-Shots
    Mohanna Hoveyda, Arjen P. de Vries, Faegheh Hasibi, Maarten de Rijke
  • Self-Para-Consistency: Improving Reasoning Tasks at Low Cost for Large Language Models
    Wenqing Chen, Weicheng Wang, Zhixuan Chu, Kui Ren, Zibin Zheng, Zhichao Lu
  • On The Persona-based Summarization of Domain-Specific Documents
    Ankan Mullick, Sombit Bose, Rounak Saha, Ayan Kumar Bhowmick, Pawan Goyal, Niloy Ganguly, Prasenjit Dey, Ravi Kokku
  • Part-of-speech Tagging for Extremely Low-resource Indian Languages
    Sanjeev Kumar, Preethi Jyothi, Pushpak Bhattacharyya
  • Leveraging Entailment Judgements in Cross-Lingual Summarisation
    Huajian Zhang, Laura Perez-Beltrachini
  • Fantastic Semantics and Where to Find Them: Investigating Which Layers of Generative LLMs Reflect Lexical Semantics
    Zhu Liu, Cunliang Kong, Ying Liu, Maosong Sun
  • Preemptive Answer “Attacks” on Chain-of-Thought Reasoning
    Rongwu Xu, Zehan Qi, Wei Xu
  • Views Are My Own, but Also Yours: Benchmarking Theory of Mind Using Common Ground
    Adil Soubki, John Murzaku, Arash Yousefi Jordehi, Peter Zeng, Magdalena Markowska, Seyed Abolghasem Mirroshandel, Owen Rambow
  • TAXI: Evaluating Categorical Knowledge Editing for Language Models
    Derek Powell, Walter Gerych, Thomas Hartvigsen
  • Automatic Bug Detection in LLM-Powered Text-Based Games Using LLMs
    Claire Jin, Sudha Rao, XIANGYU PENG, Portia Kwartema Botchway, Jessica Quaye, Chris Brockett, Bill Dolan
  • Embodied Language Learning: Opportunities, Challenges, and Future Directions
    Nadine Amin, Julia Rayz
  • Verifiable Generation with Subsentence-Level Fine-Grained Citations
    Shuyang Cao, Lu Wang
  • Rethinking Efficient Multilingual Text Summarization Meta-Evaluation
    Rilyn R. Han, Jiawen Chen, Yixin Liu, Arman Cohan
  • Are Decoder-Only Language Models Better than Encoder-Only Language Models in Understanding Word Meaning?
    Muhammad Reza Qorib, Geonsik Moon, Hwee Tou Ng
  • KEEP CHATTING! An Attractive Dataset for Continuous Conversation Agents
    Yihe Wang, Jin Liu, Yao Wan, Yitong Li, Zifeng Liu, Weipeng Chen