Long Papers
- Controllable Data Augmentation for Few-Shot Text Mining with Chain-of-Thought Attribute Manipulation
Letian Peng, Yuwei Zhang, Jingbo Shang
- Match More, Extract Better! Hybrid Matching Model for Open Domain Web Keyphrase Extraction
Mingyang Song, Liping Jing, Yi Feng
- End-to-End Emotion Semantic Parsing
Xiaotong Jiang, Zhongqing Wang, Guodong Zhou
- Overcoming Catastrophic Forgetting by Exemplar Selection in Task-oriented Dialogue System
Chen Chen, Ruizhe Li, Yuchen Hu, Yuanyuan Chen, Chengwei Qin, Qiang Zhang
- Unveiling Imitation Learning: Exploring the impact of Data Falsity to Large Language Model
Hyunsoo Cho
- The Counterfeit Conundrum: Can Code Language Models Grasp the Nuances of Their Incorrect Generations?
Alex Gu, Wen-Ding Li, Naman Jain, Theo X. Olausson, Celine Lee, Koushik Sen, Armando Solar-Lezama
- CHIME: LLM-Assisted Hierarchical Organization of Scientific Studies for Literature Review Support
Chao-Chun Hsu, Erin Bransom, Jenna Sparks, Bailey Kuehl, Chenhao Tan, David Wadden, Lucy Lu Wang, Aakanksha Naik
- Which Side Are You On? A Multi-task Dataset for End-to-End Argument Summarisation and Evaluation
Hao Li, Yuping Wu, Viktor Schlegel, Riza Batista-Navarro, Tharindu Madusanka, Iqra Zahid, Jiayan Zeng, Xiaochi Wang, Xinran He, Yizhi LI, Goran Nenadic
- Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on Graphs
Bowen Jin, Chulin Xie, Jiawei Zhang, Kashob Kumar Roy, Yu Zhang, Zheng Li, Ruirui Li, Xianfeng Tang, Suhang Wang, Yu Meng, Jiawei Han
- Text2DB: Integration-Aware Information Extraction with Large Language Model Agents
Yizhu Jiao, Sha Li, Sizhe Zhou, Heng Ji, Jiawei Han
- MediSwift: Efficient Sparse Pre-trained Biomedical Language Models
Vithursan Thangarasa, Mahmoud Salem, Shreyas Saxena, Chen-Yu Kevin Leong, Joel Hestness, Sean Lie
- Lexicon-Level Contrastive Visual-Grounding Improves Language Modeling
Chengxu Zhuang, Evelina Fedorenko, Jacob Andreas
- P-TA: Using Proximal Policy Optimization to Enhance Tabular Data Augmentation via Large Language Models
Shuo Yang, Chenchen Yuan, Yao Rong, Felix Steinbauer, Gjergji Kasneci
- Teaching-Assistant-in-the-Loop: Improving Knowledge Distillation from Imperfect Teacher Models in Low-Budget Scenarios
Yuhang Zhou, Wei Ai
- Small Models are Valuable Plug-ins for Large Language Models
Canwen Xu, Yichong Xu, Shuohang Wang, Yang Liu, Chenguang Zhu, Julian McAuley
- Are self-explanations from Large Language Models faithful?
Andreas Madsen, Sarath Chandar, Siva Reddy
- ImplicitAVE: An Open-Source Dataset and Multimodal LLMs Benchmark for Implicit Attribute Value Extraction
Henry Peng Zou, Vinay Samuel, Yue Zhou, Weizhi Zhang, Liancheng Fang, Zihe Song, Philip S. Yu, Cornelia Caragea
- Prompt Engineering a Prompt Engineer
Qinyuan Ye, Mohamed Ahmed, Reid Pryzant, Fereshte Khani
- ASPIRE: Language-Guided Data Augmentation for Improving Robustness Against Spurious Correlations
Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Utkarsh Tyagi, S Sakshi, Sanjoy Chowdhury, Dinesh Manocha
- Tables as Texts or Images: Evaluating the Table Reasoning Ability of LLMs and MLLMs
Naihao Deng, Zhenjie Sun, Ruiqi He, Aman Sikka, Yulong Chen, Lin Ma, Yue Zhang, Rada Mihalcea
- Biasly: An Expert-Annotated Dataset for Subtle Misogyny Detection and Mitigation
Brooklyn Sheppard, Anna Richter, Allison Cohen, Elizabeth Allyn Smith, Tamara Kneese, Carolyne Pelletier, Ioana Baldini, Yue Dong
- BlendSQL: A Scalable Dialect for Unifying Hybrid Question Answering in Relational Algebra
Parker Glenn, Parag Pravin Dakle, Liang Wang, Preethi Raghavan
- LLM-QAT: Data-Free Quantization Aware Training for Large Language Models
Zechun Liu, Barlas Oguz, Changsheng Zhao, Ernie Chang, Pierre Stock, Yashar Mehdad, Yangyang Shi, Raghuraman Krishnamoorthi, Vikas Chandra
- Towards Verifiable Generation: A Benchmark for Knowledge-aware Language Model Attribution
Xinze Li, Yixin Cao, Liangming Pan, Yubo Ma, Aixin Sun
- Benchmarking Cognitive Biases in Large Language Models as Evaluators
Ryan Koo, Minhwa Lee, Vipul Raheja, Jong Inn Park, Zae Myung Kim, Dongyeop Kang
- X-Instruction: Aligning Language Model in Low-resource Languages with Self-curated Cross-lingual Instructions
Chong Li, Wen Yang, Jiajun Zhang, Jinliang Lu, Shaonan Wang, Chengqing Zong
- Muffin: Mitigating Unhelpfulness in Emotional Support Conversations with Multifaceted AI Feedback
Jiashuo WANG, Chunpu Xu, Chak Tou Leong, Wenjie Li, Jing Li
- Resonance RoPE: Improving Context Length Generalization of Large Language Models
Suyuchen Wang, Ivan Kobyzev, Peng Lu, Mehdi Rezagholizadeh, Bang Liu
- MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning
Xiangru Tang, Anni Zou, Zhuosheng Zhang, Ziming Li, Yilun Zhao, Xingyao Zhang, Arman Cohan, Mark Gerstein
- Meta-Reasoning: Semantics-Symbol Deconstruction for Large Language Models
Yiming Wang, Zhuosheng Zhang, Pei Zhang, Baosong Yang, Rui Wang
- DPDLLM: A Black-box Framework for Detecting Pre-training Data from Large Language Models
Baohang Zhou, Zezhong WANG, Lingzhi Wang, Hongru WANG, Ying Zhang, Kehui Song, Xuhui Sui, Kam-Fai Wong
- PACIT: Unlocking the Power of Examples for Better In-Context Instruction Tuning
Tianci Xue, Ziqi Wang, Yixia Li, Yun Chen, Guanhua Chen
- Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language Models
Yuchen Hu, Chen Chen, Chengwei Qin, Qiushi Zhu, EngSiong Chng, Ruizhe Li
- Towards Better Graph-based Cross-document Relation Extraction via Non-bridge Entity Enhancement and Prediction Debiasing
Hao Yue, Shaopeng Lai, chengyiyang, Liang Zhang, Junfeng Yao, Jinsong Su
- Large Language Models can Share Images, Too!
Young-Jun Lee, Dokyong Lee, Joo Won Sung, Jonghwan Hyeon, Ho-Jin Choi
- CodeM: Less Data Yields More Versatility via Ability Matrix
Daoguang Zan, Ailun Yu, Wei Liu, Bo Shen, Shaoxin Lin, Yongshun Gong, Yafen Yao, Yan Liu, Bei Guan, Weihua Luo, Yongji Wang, Qianxiang Wang, Lizhen Cui
- Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning
Kung-Hsiang Huang, Mingyang Zhou, Hou Pong Chan, Yi Fung, Zhenhailong Wang, Lingyu Zhang, Shih-Fu Chang, Heng Ji
- BIDER: Bridging Knowledge Inconsistency for Efficient Retrieval-Augmented LLMs via Key Supporting Evidence
Jiajie Jin, Yutao Zhu, Yujia Zhou, Zhicheng Dou
- Beyond Literal Descriptions: Understanding and Locating Open-World Objects Aligned with Human Intentions
Wenxuan Wang, Yisi Zhang, Xingjian He, Yichen Yan, Zijia Zhao, Xinlong Wang, Jing Liu
- Incremental Sequence Labeling: A Tale of Two Shifts
Shengjie Qiu, Junhao Zheng, Zhen Liu, Yicheng Luo, Qianli Ma
- How Proficient Are Large Language Models in Formal Languages? An In-Depth Insight for Knowledge Base Question Answering
Jinxin Liu, Shulin Cao, Jiaxin Shi, Tingjian Zhang, Lunyiu Nie, Linmei Hu, Lei Hou, Juanzi Li
- MELOV: Multimodal Entity Linking with Optimized Visual Features in Latent Space
Xuhui Sui, Ying Zhang, Yu Zhao, Kehui Song, Baohang Zhou, Xiaojie Yuan
- Unsupervised Distractor Generation via Large Language Model Distilling and Counterfactual Contrastive Decoding
Fanyi Qu, Hao Sun, Yunfang Wu
- Conversational Question Answering with Language Models Generated Reformulations over Knowledge Graph
Lihui Liu, Blaine Hill, Boxin Du, Fei Wang, Hanghang Tong
- Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution Step by Step
Li Zhong, Zilong Wang, Jingbo Shang
- Are U a Joke Master? Pun Generation via Multi-Stage Curriculum Learning towards a Humor LLM
Yang Chen, Chong Yang, Tu Hu, Xinhao Chen, Man Lan, Li Cai, Xinlin Zhuang, Xuan Lin, Xin Lu, Aimin Zhou
- Knowledgeable Preference Alignment for LLMs in Domain-specific Question Answering
Yichi Zhang, Zhuo Chen, Yin Fang, Yanxi Lu, LI FANGMING, Wen Zhang, Huajun Chen
- MARIO: MAth Reasoning with code Interpreter Output - A Reproducible Pipeline
Minpeng Liao, Chengxi Li, Wei Luo, Wu Jing, Kai Fan
- DiffusPoll: Conditional Text Diffusion Model for Poll Generation
Le Cheng, Shuangyin Li
- Implanting LLM’s Knowledge via Reading Comprehension Tree for Toxicity Detection
Hankun Kang, Tieyun Qian
- LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression
Zhuoshi Pan, Qianhui Wu, Huiqiang Jiang, Menglin Xia, Xufang Luo, Jue Zhang, Qingwei Lin, Victor Rühle, Yuqing Yang, Chin-Yew Lin, H. Vicky Zhao, Lili Qiu, Dongmei Zhang
- EconNLI: Evaluating Large Language Models on Economics Reasoning
Yue Guo, Yi Yang
- Better Late Than Never: Model-Agnostic Hallucination Post-Processing Framework Towards Clinical Text Summarization
Songda Li, Yunqi Zhang, Chunyuan Deng, Yake Niu, Hui Zhao
- Finding and Editing Multi-Modal Neurons in Pre-Trained Transformers
Haowen Pan, Yixin Cao, Xiaozhi Wang, Xun Yang, Meng Wang
- Controllable Text Generation with Residual Memory Transformer
Hanqing Zhang, Si Sun, Haiming Wu, Dawei Song
- Prompt-Based Length Controlled Generation with Multiple Control Types
RENLONG JIE, Xiaojun Meng, Lifeng Shang, Xin Jiang, Qun Liu
- PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain
Liang Chen, Yichi Zhang, Shuhuai Ren, Haozhe Zhao, Zefan Cai, Yuchi Wang, Peiyi Wang, Xiangdi Meng, Tianyu Liu, Baobao Chang
- Pearl: A Review-driven Persona-Knowledge Grounded Conversational Recommendation Dataset
Minjin Kim, Minju Kim, Hana Kim, Beong-woo Kwak, SeongKu Kang, Youngjae Yu, Jinyoung Yeo, Dongha Lee
- CoLLaVO: Crayon Large Language and Vision mOdel
Byung-Kwan Lee, Beomchan Park, Chae Won Kim, Yong Man Ro
- Modelling Variability in Human Annotator Simulation
Wen Wu, Wenlin Chen, Chao Zhang, Phil Woodland
- BEnQA: A Question Answering Benchmark for Bengali and English
Sheikh Shafayat, H M QUAMRAN HASAN, Minhajur Rahman Chowdhury Mahim, Rifki Afina Putri, James Thorne, Alice Oh
- MORE: Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning
Wanqing Cui, Keping Bi, Jiafeng Guo, Xueqi Cheng
- Cutting Off the Head Ends the Conflict: A Mechanism for Interpreting and Mitigating Knowledge Conflicts in Language Models
Zhuoran Jin, Pengfei Cao, Hongbang Yuan, Yubo Chen, Jiexin Xu, Huaijun Li, Xiaojian Jiang, Kang Liu, Jun Zhao
- BioT5+: Towards Generalized Biological Understanding with IUPAC Integration and Multi-task Tuning
Qizhi Pei, Lijun Wu, Kaiyuan Gao, Xiaozhuan Liang, Yin Fang, Jinhua Zhu, Shufang Xie, Tao Qin, Rui Yan
- SIBO: A Simple Booster for Parameter-Efficient Fine-Tuning
Zhihao Wen, Jie Zhang, Yuan Fang
- GeoEval: Benchmark for Evaluating LLMs and Multi-Modal Models on Geometry Problem-Solving
Jiaxin Zhang, Zhong-Zhi Li, Ming-Liang Zhang, Fei Yin, Cheng-Lin Liu, Yashar Moshfeghi
- Boosting Textural NER with Synthetic Image and Instructive Alignment
Jiahao Wang, Wenjun Ke, Peng Wang, Hang Zhang, Dong Nie, Jiajun Liu, Guozheng Li, Ziyu Shang
- Neurons in Large Language Models: Dead, N-gram, Positional
Elena Voita, Javier Ferrando, Christoforos Nalmpantis
- LLMs as Bridges: Reformulating Grounded Multimodal Named Entity Recognition
Jinyuan Li, Han Li, Di Sun, Jiahao Wang, Wenkun Zhang, Zan Wang, Gang Pan
- FlowVQA: Mapping Multimodal Logic in Visual Question Answering with Flowcharts
Shubhankar Singh, Purvi Chaurasia, Yerram Varun, Pranshu Pandya, Vatsal Gupta, Vivek Gupta, Dan Roth
- Unveiling the Achilles’ Heel of NLG Evaluators: A Unified Adversarial Framework Driven by Large Language Models
Yiming Chen, Chen Zhang, Danqing Luo, Luis Fernando D’Haro, Robby T. Tan, Haizhou Li
- Teacher-Student Training for Debiasing: General Permutation Debiasing for Large Language Models
Adian Liusie, Yassir Fathullah, Mark Gales
- Uncovering Limitations of Large Language Models in Information Seeking from Tables
Chaoxu Pang, Yixuan Cao, Chunhao Yang, Ping Luo
- An Ensemble-of-Experts Framework for Rehearsal-free Continual Relation Extraction
Shen Zhou, Yongqi Li, Xin Miao, Tieyun Qian
- Temporal Validity Change Prediction
Georg Wenzel, Adam Jatowt
- RIFF: Learning to Rephrase Inputs for Few-shot Fine-tuning of Language Models
Saeed Najafi, Alona Fyshe
- Modelling Commonsense Commonalities with Multi-Facet Concept Embeddings
Hanane Kteich, Na Li, Usashi Chatterjee, Zied Bouraoui, Steven Schockaert
- Revisiting Multimodal Transformers for Tabular Data with Text Fields
Thomas Bonnier
- ConTempo: A Unified Temporally Contrastive Framework for Temporal Relation Extraction
Jingcheng Niu, Saifei Liao, Victoria Ng, Simon De Montigny, Gerald Penn
- CHARP: Conversation History AwaReness Probing for Knowledge-grounded Dialogue Systems
Abbas Ghaddar, David Alfonso-Hermelo, Philippe Langlais, Mehdi Rezagholizadeh, Boxing Chen, Prasanna Parthasarathi
- CriticBench: Benchmarking LLMs for Critique-Correct Reasoning
Zicheng Lin, Zhibin Gou, Tian Liang, Ruilin Luo, Haowei Liu, Yujiu Yang
- DAFNet: Dynamic Auxiliary Fusion for Sequential Model Editing in Large Language Models
Taolin Zhang, Qizhou Chen, Dongyang Li, Chengyu Wang, Xiaofeng He, Longtao Huang, Hui Xue’, Jun Huang
- Controllable Text Summarization: Unraveling Challenges, Approaches, and Prospects - A Survey
Ashok Urlana, Pruthwik Mishra, Tathagato Roy, Rahul Mishra
- Benchmarking Large Language Models on Communicative Medical Coaching: A Dataset and a Novel System
Hengguan Huang, Songtao Wang, Hongfu Liu, Hao Wang, Ye Wang
- Everything of Thoughts: Defying the Law of Penrose Triangle for Thought Generation
Ruomeng Ding, Chaoyun Zhang, Lu Wang, Yong Xu, Minghua Ma, Wei Zhang, Si Qin, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang
- Data Augmentation using LLMs: Data Perspectives, Learning Paradigms and Challenges
Bosheng Ding, Chengwei Qin, Ruochen Zhao, Tianze Luo, Xinze Li, Guizhen Chen, Wenhan Xia, Junjie Hu, Anh Tuan Luu, Shafiq Joty
- CeeBERT: Cross-Domain Inference in Early Exit BERT
Divya Jyoti Bajpai, Manjesh Kumar Hanawal
- UNIWIZ: A Unified Large Language Model Orchestrated Wizard for Safe Knowledge Grounded Conversations
Souvik Das, Rohini Srihari
- VALOR-EVAL: Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models
Haoyi Qiu, Wenbo Hu, Zi-Yi Dou, Nanyun Peng
- Cyclical Contrastive Learning Based on Geodesic for Zero-shot Cross-lingual Spoken Language Understanding
Xuxin Cheng, Zhihong Zhu, Bang Yang, Xianwei Zhuang, Hongxiang Li, Yuexian Zou
- Towards Safer Large Language Models through Machine Unlearning
Zheyuan Liu, Guangyao Dou, Zhaoxuan Tan, Yijun Tian, Meng Jiang
- The Impact of Reasoning Step Length on Large Language Models
Mingyu Jin, Qinkai Yu, Dong Shu, Haiyan Zhao, Wenyue Hua, Yanda Meng, Yongfeng Zhang, Mengnan Du
- Towards Understanding Task-agnostic Debiasing Through the Lenses of Intrinsic Bias and Forgetfulness
Guangliang Liu, Milad Afshari, Xitong Zhang, Zhiyu Xue, Avrajit Ghosh, Bidhan Bashyal, Rongrong Wang, Kristen Johnson
- SKGSum: Structured Knowledge-Guided Document Summarization
Qiqi Wang, Ruofan Wang, Kaiqi Zhao, Robert Amor, Benjamin Liu, Jiamou Liu, Xianda Zheng, Zijian Huang
- Chinese Spoken Named Entity Recognition in Real-world Scenarios: Dataset and Approaches
Shilin Zhou, Zhenghua Li, Chen Gong, Lei Zhang, Yu Hong, Min Zhang
- Can Large Multimodal Models Uncover Deep Semantics Behind Images?
Yixin Yang, Zheng Li, Qingxiu Dong, Heming Xia, Zhifang Sui
- Harvesting Events from Multiple Sources: Towards a Cross-Document Event Extraction Paradigm
Qiang Gao, Zixiang Meng, Bobo Li, Jun Zhou, Fei Li, Chong Teng, Donghong Ji
- A Graph per Persona: Reasoning about Subjective Natural Language Descriptions
EunJeong Hwang, Vered Shwartz, Dan Gutfreund, Veronika Thost
- MolTC: Towards Molecular Relational Modeling In Language Models
Junfeng Fang, Shuai Zhang, Chang Wu, Zhengyi Yang, Zhiyuan Liu, Sihang Li, Kun Wang, Wenjie Du, Xiang Wang
- KPEval: Towards Fine-Grained Semantic-Based Keyphrase Evaluation
Di Wu, Da Yin, Kai-Wei Chang
- Learning Low-dimensional Multi-domain Knowledge Graph Embedding via Dual Archimedean Spirals
Jiang Li, Xiangdong Su, Fujun Zhang, Guanglai Gao
- LoRA Meets Dropout under a Unified Framework
Sheng Wang, Liheng Chen, Jiyue Jiang, Boyang XUE, Lingpeng Kong, Chuan Wu
- Enhancing Text-to-SQL Parsing through Question Rewriting and Execution-Guided Refinement
Wenxin Mao, Ruiqi Wang, Jiyu Guo, Jichuan Zeng, Cuiyun Gao, Peiyi Han, Chuanyi Liu
- The Knowledge Alignment Problem: Bridging Human and External Knowledge for Large Language Models
Shuo Zhang, Liangming Pan, Junzhou Zhao, William Yang Wang
- ChatKBQA: A Generate-then-Retrieve Framework for Knowledge Base Question Answering with Fine-tuned Large Language Models
Haoran Luo, Haihong E, Zichen Tang, Shiyao Peng, Yikai Guo, Wentai Zhang, Chenghao Ma, Guanting Dong, Meina Song, Wei Lin, Yifan Zhu, Anh Tuan Luu
- Achilles-Bench: A Challenging Benchmark for Low-Resource Evaluation
Yudong Wang, Chang Ma, Qingxiu Dong, Zhifang Sui, Lingpeng Kong, Jingjing Xu
- INTERVENOR: Prompting the Coding Ability of Large Language Models with the Interactive Chain of Repair
Hanbin Wang, Zhenghao Liu, Shuo Wang, Ganqu Cui, Ning Ding, Zhiyuan Liu, Ge Yu
- Context-Aware Tracking and Dynamic Introduction for Incomplete Utterance Rewriting in Extended Multi-Turn Dialogues
Xinnan Guo, Qian Zhu, Qiuhui Shi, Xuan Lin, Liubin Wang, DaqianLi, Yongrui Chen
- EmotionQueen: A Benchmark for Evaluating Empathy of Large Language Models
Yuyan Chen, Songzhou Yan, Sijia Liu, Yueze Li, Yanghua Xiao
- Plum: Prompt Learning using Metaheuristics
Rui Pan, Shuo Xing, Shizhe Diao, Wenhe Sun, Xiang Liu, KaShun SHUM, Jipeng Zhang, Renjie Pi, Tong Zhang
- HOTVCOM: Generating Buzzworthy Comments for Videos
Yuyan Chen, Songzhou Yan, Qingpei Guo, Jiyuan Jia, Zhixu Li, Yanghua Xiao
- Do Large Language Models have Problem-Solving Capability under Incomplete Information Scenarios?
Yuyan Chen, Yueze Li, Songzhou Yan, Sijia Liu, Jiaqing Liang, Yanghua Xiao
- Distilling Robustness into Natural Language Inference Models with Domain-Targeted Augmentation
Joe Stacey, Marek Rei
- Into the Unknown: Generating Geospatial Descriptions for New Environments
Tzuf Paz-Argaman, John Palowitch, SAYALI KULKARNI, Reut Tsarfaty, Jason Michael Baldridge
- Unpacking Tokenization: Evaluating Text Compression and its Correlation with Model Performance
Omer Goldman, Avi Caciularu, Matan Eyal, Kris Cao, Idan Szpektor, Reut Tsarfaty
- Length-aware Byte Pair Encoding for Mitigating Over-segmentation in Korean Machine Translation
Jungseob Lee, Hyeonseok Moon, Seungjun Lee, Chanjun Park, Sugyeong Eo, Hyunwoong Ko, Jaehyung Seo, Seungyoon Lee, Heuiseok Lim
- Multilingual Instruction Tuning With Just a Pinch of Multilinguality
Uri Shaham, Jonathan Herzig, Roee Aharoni, Idan Szpektor, Reut Tsarfaty, Matan Eyal
- M3-Embedding: Multi-Linguality, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation
Jianlv Chen, Shitao Xiao, Peitian Zhang, Kun Luo, Defu Lian, Zheng Liu
- Iterative Refinement of Project-Level Code Context for Precise Code Generation with Compiler Feedback
Zhangqian Bi, Yao Wan, Zheng Wang, Hongyu Zhang, Batu Guan, Fangxin Lu, Zili Zhang, Yulei Sui, Hai Jin, Xuanhua Shi
- An Element is Worth a Thousand Words: Enhancing Legal Case Retrieval by Incorporating Legal Elements
Chenlong Deng, Zhicheng Dou, Yujia Zhou, Peitian Zhang, Kelong Mao
- SoMeLVLM: A Large Vision Language Model for Social Media Processing
Xinnong Zhang, Haoyu Kuang, Xinyi Mou, Hanjia Lyu, Kun Wu, Siming Chen, Jiebo Luo, Xuanjing Huang, zhongyu wei
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models
Jaehyung Seo, Jaewook Lee, Chanjun Park, SeongTae Hong, Seungjun Lee, Heuiseok Lim
- NeuroPrune: A Neuro-inspired Topological Sparse Training Algorithm for Large Language Models
Amit Dhurandhar, Tejaswini Pedapati, Ronny Luss, Soham Dan, Aurelie Lozano, Payel Das, Georgios Kollias
- Ranking Large Language Models without Ground Truth
Amit Dhurandhar, Rahul Nair, Moninder Singh, Elizabeth M. Daly, Karthikeyan Natesan Ramamurthy
- Integrating Physician Diagnostic Logic into Large Language Models: Preference Learning from Process Feedback
Chengfeng Dou, ying zhang, Zhi Jin, Wenpin Jiao, Haiyan Zhao, Yongqiang Zhao, Zhengwei Tao
- LM-Cocktail: Resilient Tuning of Language Models via Model Merging
Shitao Xiao, Zheng Liu, Peitian Zhang, Xingrun Xing
- Episodic Memory Retrieval from LLMs: A Neuromorphic Mechanism to Generate Commonsense Counterfactuals for Relation Extraction
Xin Miao, Yongqi Li, Shen Zhou, Tieyun Qian
- SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 13 Languages
Nedjma Ousidhoum, Shamsuddeen Hassan Muhammad, Mohamed Abdalla, Idris Abdulmumin, Ibrahim Said Ahmad, Sanchit Ahuja, Alham Fikri Aji, Vladimir Araujo, Abinew Ali Ayele, pavan baswani, Meriem Beloucif, Chris Biemann, Sofia Bourhim, Christine de Kock, Genet Shanko Dekebo, Oumaima Hourrane, Gopichand Kanumolu, Lokesh Madasu, Samuel Rutunda, Manish Shrivastava, Thamar Solorio, Nirmal Surange, Hailegnaw Getaneh Tilaye, Krishnapriya Vishnubhotla, Genta Indra Winata, Seid Muhie Yimam, Saif M. Mohammad
- Alirector: Alignment-Enhanced Chinese Grammatical Error Corrector
Haihui Yang, Xiaojun Quan
- The Emotion Dynamics of Literary Novels
Krishnapriya Vishnubhotla, Adam Hammond, Graeme Hirst, Saif M. Mohammad
- LANS: A Layout-Aware Neural Solver for Plane Geometry Problem
Zhong-Zhi Li, Ming-Liang Zhang, Fei Yin, Cheng-Lin Liu
- Knowledge Crosswords: Geometric Knowledge Reasoning with Large Language Models
Wenxuan Ding, Shangbin Feng, Yuhan Liu, Zhaoxuan Tan, Vidhisha Balachandran, Tianxing He, Yulia Tsvetkov
- DELL: Generating Reactions and Explanations for LLM-Based Misinformation Detection
Herun Wan, Shangbin Feng, Zhaoxuan Tan, Heng Wang, Yulia Tsvetkov, Minnan Luo
- The Language Barrier: Dissecting Safety Challenges of LLMs in Multilingual Contexts
Lingfeng Shen, Weiting Tan, Sihao Chen, Yunmo Chen, Jingyu Zhang, Haoran Xu, Boyuan Zheng, Philipp Koehn, Daniel Khashabi
- Self-Specialization: Uncovering Latent Expertise within Large Language Models
Junmo Kang, Hongyin Luo, Yada Zhu, Jacob A Hansen, James R. Glass, David Daniel Cox, Alan Ritter, Rogerio Feris, Leonid Karlinsky
- FUSE: Measure-Theoretic Compact Fuzzy Set Representation for Taxonomy Expansion
Fred Xu, Song Jiang, Zijie Huang, Xiao Luo, Shichang Zhang, Yuanzhou Chen, Yizhou Sun
- Chain of Logic: Rule-Based Reasoning with Large Language Models
Sergio Servantez, Joe Barrow, Kristian J Hammond, Rajiv Jain
- Merging Facts, Crafting Fallacies: Evaluating the Contradictory Nature of Aggregated Factual Claims in Long-Form Generations
Cheng-Han Chiang, Hung-yi Lee
- Can You Learn Semantics Through Next-Word Prediction? The Case of Entailment
William Merrill, Zhaofeng Wu, Norihito Naka, Yoon Kim, Tal Linzen
- Simulated Misinformation Susceptibility (SMISTS): Enhancing Misinformation Research with Large Language Model Simulations
Weicheng Ma, Chunyuan Deng, Aram Moossavi, Lili Wang, Soroush Vosoughi, Diyi Yang
- Social Intelligence Data Infrastructure: Structuring the Present and Navigating the Future
Minzhi Li, Weiyan Shi, Caleb Ziems, Diyi Yang
- MODABS: Multi-Objective Learning for Dynamic Aspect-Based Summarization
Xiaobo Guo, Soroush Vosoughi
- Non-compositional Expression Generation and its Continual Learning
Jianing Zhou, Suma Bhat
- Medical Dialogue System: A Survey of Categories, Methods, Evaluation and Challenges
Xiaoming Shi, Zeming Liu, Li Du, Yuxuan Wang, Hongru WANG, Yuhang Guo, Tong Ruan, JIE XU, Xiaofan Zhang, Shaoting Zhang
- Direct Evaluation of Chain-of-Thought in Multi-hop Reasoning with Knowledge Graphs
Thi Minh Vuong Nguyen, LINHAO LUO, Fatemeh Shiri, Dinh Phung, Yuan-Fang Li, Trang Vu, Gholamreza Haffari
- Comprehensive Abstractive Comment Summarization with Dynamic Clustering and Chain of Thought
Longyin Zhang, Bowei Zou, Jacintha Wee Yun Yi, AiTi Aw
- Self-Supervised Position Debiasing for Large Language Models
Zhongkun Liu, Zheng Chen, Mengqi Zhang, Zhaochun Ren, Pengjie Ren, Zhumin Chen
- HyperCL: A Contrastive Learning Framework for Hyper-Relational Knowledge Graph Embedding with Hierarchical Ontology
Yuhuan Lu, Weijian Yu, Xin Jing, Dingqi Yang
- Encoding Hierarchical Schema via Concept Flow for Multifaceted Ideology Detection
Songtao Liu, Bang Wang, Wei Xiang, Han Xu, Minghua Xu
- Character-Level Chinese Dependency Parsing via Modeling Latent Intra-Word Structure
Yang Hou, Zhenghua Li
- AlignRE: An Encoding and Semantic Alignment Approach for Zero-Shot Relation Extraction
Zehan Li, Fu Zhang, Jingwei Cheng
- Disperse-Then-Merge: Pushing the Limits of Instruction Tuning via Alignment Tax Reduction
Tingchen Fu, Deng Cai, Lemao Liu, Shuming Shi, Rui Yan
- Efficient Knowledge Infusion via KG-LLM Alignment
Zhouyu Jiang, Ling Zhong, Mengshu Sun, Jun Xu, Rui Sun, Hui Cai, SHUHAN LUO, Zhiqiang Zhang
- Towards Precise Localization of Critical Errors in Machine Translation
Dahyun Jung, Sugyeong Eo, Heuiseok Lim
- LoRAPrune: Structured Pruning Meets Low-Rank Parameter-Efficient Fine-Tuning
Mingyang Zhang, Hao Chen, Chunhua Shen, Zhen Yang, Linlin Ou, Xinyi Yu, Bohan Zhuang
- Speculative Decoding via Early-exiting for Faster LLM Inference with Thompson Sampling Control Mechanism
Jiahao Liu, Qifan Wang, Jingang Wang, Xunliang Cai
- AgentTuning: Enabling Generalized Agent Abilities for LLMs
Aohan Zeng, Mingdao Liu, Rui Lu, Bowen Wang, Xiao Liu, Yuxiao Dong, Jie Tang
- Transition-based Opinion Generation for Aspect-based Sentiment Analysis
Tianlai Ma, Zhongqing Wang, Guodong Zhou
- Modeling Dynamic Topics in Chain-Free Fashion by Evolution-Tracking Contrastive Learning and Unassociated Word Exclusion
Xiaobao Wu, Xinshuai Dong, Liangming Pan, Thong Thanh Nguyen, Anh Tuan Luu
- A Chinese Dataset for Evaluating the Safeguards in Large Language Models
Yuxia Wang, Zenan Zhai, Haonan Li, Xudong Han, Shom Lin, Zhenxuan ZHANG, Angela Jingru Zhao, Preslav Nakov, Timothy Baldwin
- LLMFactor: Extracting Profitable Factors through Prompts for Explainable Stock Movement Prediction
Meiyun Wang, Kiyoshi Izumi, Hiroki Sakaji
- You Only Look at Screens: Multimodal Chain-of-Action Agents
Zhuosheng Zhang, Aston Zhang
- $\rm SP^3$: Enhancing Structured Pruning via PCA Projection
Yuxuan Hu, Jing Zhang, Zhe Zhao, Chen Zhao, Xiaodong Chen, Cuiping Li, Hong Chen
- GENDEX: Generative Data Augmentation Strategy Leveraging External Data for Abstractive Dialogue Summarization
Sangwon Park, Hongseok Choi, Dongha Choi, Hyunju Lee
- A Tale of Two Revisions: Summarizing Changes Across Document Versions
Santosh T.Y.S.S, Natwar Modani, Apoorv Saxena
- Refine, Align, and Aggregate: Multi-view Linguistic Features Enhancement for Aspect Sentiment Triplet Extraction
Guixin Su, Mingmin Wu, Zhongqiang Huang, Yongcheng Zhang, Tongguan Wang, Yuxue Hu, Ying Sha
- The Music Maestro or The Musically Challenged, A Massive Music Evaluation Benchmark for Large Language Models
Jiajia Li, lu Yang, Mingni Tang, Chenchong, Zuchao Li, Ping Wang, hai zhao
- PyramidInfer: Pyramid KV Cache Compression for High-throughput LLM Inference
Dongjie Yang, Xiaodong Han, Yan Gao, Yao Hu, Shilin Zhang, hai zhao
- From Role-Play to Drama-Interaction: An LLM Solution
Weiqi Wu, Hongqiu Wu, Lai Jiang, Xingyuan Liu, hai zhao, Min Zhang
- TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language Models
Jaewoo Ahn, Taehyun Lee, Junyoung Lim, Jin-Hwa Kim, Sangdoo Yun, Hwaran Lee, Gunhee Kim
- Red Teaming Visual Language Models
Mukai Li, Lei Li, Yuwei Yin, Masood Ahmed, Zhenguang Liu, Qi Liu
- Enhancing Semantic Consistency of Large Language Models through Model Editing: An Interpretability-Oriented Approach
JINGYUAN YANG, Dapeng Chen, Yajing Sun, Rongjun Li, Zhiyong Feng, Wei Peng
- Semantic Skill Grounding for Embodied Instruction-Following in Cross-Domain Environments
Sangwoo Shin, SeungHyun Kim, Youngsoo Jang, Moontae Lee, Honguk Woo
- LIRE: listwise reward enhancement for preference alignment
Mingye Zhu, Yi Liu, Lei Zhang, Junbo Guo, Zhendong Mao
- See It All: Contextualized Late Aggregation for 3D Dense Captioning
Minjung Kim, Hyung Suk Lim, Seung Hwan Kim, Soonyoung Lee, Bumsoo Kim, Gunhee Kim
- $\texttt{DARA}$: Decomposition-Alignment-Reasoning Autonomous Language Agent for Question Answering over Knowledge Graphs
Haishuo Fang, Xiaodan Zhu, Iryna Gurevych
- GKT: A Novel Guidance-Based Knowledge Transfer Framework For Efficient Cloud-edge Collaboration LLM Deployment
Yao Yao, Zuchao Li, hai zhao
- Compositional Generalization with Grounded Language Models
Sondre Wold, Étienne Simon, Lucas Georges Gabriel Charpentier, Egor V. Kostylev, Erik Velldal, Lilja Øvrelid
- Rethinking Negative Instances for Generative Named Entity Recognition
Yuyang Ding, Juntao Li, Pinzheng Wang, Zecheng Tang, Yan Bowen, Min Zhang
- WilKE: Wise-Layer Knowledge Editor for Lifelong Knowledge Editing
Chenhui Hu, Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao
- DINER: Debiasing Aspect-based Sentiment Analysis with Multi-variable Causal Inference
Jialong Wu, Linhai Zhang, Deyu Zhou, Guoqiang Xu
- STAR: Constraint LoRA with Dynamic Active Learning for Data-Efficient Fine-Tuning of Large Language Models
Linhai Zhang, Jialong Wu, Deyu Zhou, Guoqiang Xu
- How Much Does Nonverbal Communication Conform to Entropy Rate Constancy?: A Case Study on Listener Gaze in Interaction
Yu Wang, Yang Xu, Gabriel Skantze, Hendrik Buschmeier
- Lost in the Source Language: How Large Language Models Evaluate the Quality of Machine Translation
Xu Huang, Zhirui Zhang, Xiang Geng, Yichao Du, Jiajun Chen, Shujian Huang
- Chain-of-Verification Reduces Hallucination in Large Language Models
Shehzaad Dhuliawala, Mojtaba Komeili, Jing Xu, Roberta Raileanu, Xian Li, Asli Celikyilmaz, Jason E Weston
- Measuring Bargaining Abilities of LLMs: A Benchmark and A Buyer-Enhancement Method
Tian Xia, Zhiwei He, Tong Ren, Yibo Miao, Zhuosheng Zhang, Yang Yang, Rui Wang
- DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories
Jia Li, Ge Li, Yunfei Zhao, Yongmin Li, Huanyu Liu, Hao Zhu, Lecheng Wang, Kaibo Liu, Zheng Fang, Lanshen Wang, jiazheng ding, Xuanming Zhang, YUQI ZHU, Yihong Dong, Zhi Jin, Binhua Li, Fei Huang, Yongbin Li, Bin Gu, Mengfei Yang
- LPNL: Scalable Link Prediction with Large Language Models
Baolong Bi, Shenghua Liu, Yiwei Wang, Lingrui Mei, Xueqi Cheng
- Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives
Thong Thanh Nguyen, Yi Bin, Junbin Xiao, Leigang Qu, Yicong Li, Jay Zhangjie Wu, Cong-Duy T Nguyen, See-Kiong Ng, Anh Tuan Luu
- Generative Input: Towards Next-Generation Input Methods Paradigm
Keyu Ding, Yongcan Wang, Zihang Xu, Zhenzhen Jia, Enhong Chen
- A + B: A General Generator-Reader Framework for Optimizing LLMs to Unleash Synergy Potential
Wei Tang, Yixin Cao, Jiahao Ying, Bo Wang, Yuyue Zhao, Yong Liao, Pengyuan Zhou
- Functional Overlap Reranking for Neural Code Generation
Hung Quoc To, Minh Huynh Nguyen, Nghi D. Q. Bui
- Adversarial Preference Optimization: Enhancing Your Alignment via RM-LLM Game
Pengyu Cheng, Yifan Yang, Jian Li, Yong Dai, Tianhao Hu, peixin cao, nan du, Xiaolong Li
- Pinpointing Diffusion Grid Noise to Enhance Aspect Sentiment Quad Prediction
Linan ZHU, Xiangfan Chen, Xiaolei Guo, Chenwei Zhang, Zhechao Zhu, Zehai Zhou, Xiangjie Kong
- Continual Contrastive Spoken Language Understanding
Umberto Cappellazzo, Enrico Fini, Muqiao Yang, Daniele Falavigna, Alessio Brutti, Bhiksha Raj
- LLM as Prompter: Low-resource Inductive Reasoning on Arbitrary Knowledge Graphs
Kai Wang, YUWEI XU, Zhiyong Wu, Siqiang Luo
- Unsupervised Parsing by Searching for Frequent Word Sequences among Sentences with Equivalent Predicate-Argument Structures
Junjie Chen, Xiangheng He, Danushka Bollegala, Yusuke Miyao
- Data-Centric Explainable Debiasing for Improving Fairness in Pre-trained Language Models
Yingji Li, Mengnan Du, Rui Song, Xin Wang, Ying Wang
- Knowledge-Driven Cross-Document Relation Extraction
Monika Jain, Raghava Mutharaju, Kuldeep Singh, Ramakanth Kavuluru
- Injecting Salesperson’s Dialogue Strategies in Large Language Models with Chain-of-Thought Reasoning
Wen Yu Chang, Yun-Nung Chen
- KG-Adapter: Enabling Knowledge Graph Integration in Large Language Models through Parameter-Efficient Fine-Tuning
Shiyu Tian, Yangyang Luo, Tianze Xu, Caixia Yuan, Huixing Jiang, Chen Wei, Xiaojie Wang
- Just Ask One More Time! Self-Agreement Improves Reasoning of Language Models in (Almost) All Scenarios
Lei Lin, Jiayi Fu, Pengli Liu, Qingyang Li, Yan Gong, Junchen Wan, Fuzheng Zhang, Zhongyuan Wang, Di ZHANG, Kun Gai
- Evaluating LLMs’ Mathematical Reasoning in Financial Document Question Answering
Pragya Srivastava, Manuj Malik, Vivek Gupta, Tanuja Ganu, Dan Roth
- Can Large Language Models Mine Interpretable Financial Factors More Effectively? A Neural-Symbolic Factor Mining Agent Model
Zhiwei Li, Ran Song, Caihong Sun, Wei Xu, Zhengtao Yu, Ji-Rong Wen
- Discerning and Resolving Knowledge Conflicts through Adaptive Decoding with Contextual Information-Entropy Constraint
Xiaowei Yuan, Zhao Yang, Yequan Wang, Shengping Liu, Jun Zhao, Kang Liu
- SALAD-Bench: A Hierarchical and Comprehensive Safety Benchmark for Large Language Models
Lijun Li, Bowen Dong, Ruohui Wang, Xuhao Hu, Wangmeng Zuo, Dahua Lin, Yu Qiao, Jing Shao
- Extracting and Encoding: Leveraging Large Language Models and Medical Knowledge to Enhance Radiological Text Representation
Pablo Messina, Rene Vidal, Denis Parra, Alvaro Soto, Vladimir Araujo
- GNNavi: Navigating the Information Flow in Large Language Models by Graph Neural Network
Shuzhou Yuan, Ercong Nie, Michael Färber, Helmut Schmid, Hinrich Schuetze
- M-QALM: A Benchmark to Assess Clinical Reading Comprehension and Knowledge Recall in Large Language Models via Question Answering
Anand Subramanian, Viktor Schlegel, Abhinav Ramesh Kashyap, Thanh-Tung Nguyen, Vijay Prakash Dwivedi, Stefan Winkler
- Autonomous Workflow for Multimodal Fine-Grained Training Assistants Towards Mixed Reality
Jiahuan Pei, Irene Viola, Haochen Huang, Junxiao Wang, Moonisa Ahsan, Fanghua Ye, Jiang Yiming, Yao Sai, Di Wang, Zhumin Chen, Pengjie Ren, Pablo Cesar
- Perceptions of Language Technology Failures from South Asian English Speakers
Faye Holt, William Barr Held, Diyi Yang
- A Mechanistic Analysis of a Transformer Trained on a Symbolic Multi-Step Reasoning Task
Jannik Brinkmann, Abhay Sheshadri, Victor Levoso, Paul Swoboda, Christian Bartelt
- Optimal Transport Guided Correlation Assignment for Multimodal Entity Linking
Zefeng Zhang, Jiawei Sheng, ZHANG CHUANG, liangyunzhi, Wenyuan Zhang, Siqi Wang, Tingwen Liu
- On Efficiently Representing Regular Languages as RNNs
Anej Svete, Robin Chan, Ryan Cotterell
- A Survey on Modelling Morality for Text Analysis
Ines Reinig, Maria Becker, Ines Rehbein, Simone Paolo Ponzetto
- Your Vision-Language Model Itself Is a Strong Filter: Towards High-Quality Instruction Tuning with Data Selection
Ruibo Chen, Yihan Wu, Lichang Chen, Guodong Liu, Qi He, Tianyi Xiong, Chenxi Liu, Junfeng Guo, Heng Huang
- DebugBench: Evaluating Debugging Capability of Large Language Models
Runchu Tian, Yining Ye, Yujia Qin, Xin Cong, Yankai Lin, Yinxu Pan, Yesai Wu, Hui Haotian, Liu Weichuan, Zhiyuan Liu, Maosong Sun
- POP-CEE: Position-oriented Prompt-tuning Model for Causal Emotion Entailment
Zhihan Zhou, Xue Gu, Yujie Zhao, Hao Xu
- Wav2SQL: Direct Generalizable Speech-To-SQL Parsing
Huadai Liu, Rongjie Huang, Jinzheng He, Gang Sun, Ran Shen, Xize Cheng, Zhou Zhao
- E2-LLM: Efficient and Extreme Length Extension of Large Language Models
Jiaheng Liu, ZhiqiBai, Yuanxing Zhang, Zhang Chenchen, YuangZh, Ge Zhang, JiakaiWang, Haoran Que, Yukang Chen, Wenbo Su, Tiezheng Ge, Jie Fu, Wenhu Chen, Bo Zheng
- Are Female Carpenters like Blue Bananas? A Corpus Investigation of Occupation Gender Typicality
Da JU, Karen Ullrich, Adina Williams
- Call Me When Necessary: LLMs can Efficiently and Faithfully Reason over Structured Environments
Sitao Cheng, Ziyuan Zhuang, Yong Xu, Fangkai Yang, Chaoyun Zhang, Xiaoting Qin, Xiang Huang, Ling Chen, Qingwei Lin, Dongmei Zhang, Saravan Rajmohan, Qi Zhang
- Legal Judgment Reimagined: PredEx and the Rise of Intelligent AI Interpretation in Indian Courts
Shubham Kumar Nigam, Anurag Sharma, Danush Khanna, Noel Shallum, Kripabandhu Ghosh, Arnab Bhattacharya
- RulE: Knowledge Graph Reasoning with Rule Embedding
Xiaojuan Tang, Song-Chun Zhu, Yitao Liang, Muhan Zhang
- Multi-Objective Linguistic Control of Large Language Models
Dang Nguyen, Jiuhai Chen, Tianyi Zhou
- Evaluating the Smooth Control of Attribute Intensity in Text Generation with LLMs
Shang Zhou, Feng Yao, Chengyu Dong, Zihan Wang, Jingbo Shang
- Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios
Shijue Huang, Wanjun Zhong, Jianqiao Lu, Qi Zhu, Jiahui Gao, Weiwen Liu, Yutai Hou, Xingshan Zeng, Yasheng Wang, Lifeng Shang, Xin Jiang, Ruifeng Xu, Qun Liu
- Do Androids Know They’re Only Dreaming of Electric Sheep?
Sky CH-Wang, Benjamin Van Durme, Jason Eisner, Chris Kedzie
- URG: A Unified Ranking and Generation Method for Ensembling Language Models
Bo Lv, Chen Tang, Yanan Zhang, Xin Liu, Ping Luo, Yue Yu
- Multi-Modal Retrieval For Large Language Model Based Speech Recognition
Aditya Gourav, Jari Kolehmainen, Prashanth Gurunath Shivakumar, Yile Gu, Grant Strimel, Ankur Gandhe, Ariya Rastrow, Ivan Bulyko
- LoraRetriever: Input-Aware LoRA Retrieval and Composition for Mixed Tasks in the Wild
Ziyu Zhao, Leilei Gan, Guoyin Wang, Wangchunshu Zhou, Hongxia Yang, Kun Kuang, Fei Wu
- ELAD: Explanation-Guided Large Language Models Active Distillation
Yifei Zhang, Bo Pan, Chen Ling, Yuntong Hu, Liang Zhao
- Evaluating the Elementary Multilingual Capabilities of Large Language Models with MultiQ
Carolin Holtermann, Paul Röttger, Timm Dill, Anne Lauscher
- The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation (RAG)
Shenglai Zeng, Jiankun Zhang, Pengfei He, Yiding Liu, Yue Xing, Han Xu, Jie Ren, Yi Chang, Shuaiqiang Wang, Dawei Yin, Jiliang Tang
- EmpathicStories++: A Multimodal Dataset for Empathy Towards Personal Experiences
Jocelyn J Shen, Yubin Kim, Mohit Hulse, Wazeer Zulfikar, Sharifa Alghowinem, Cynthia Breazeal, Hae Won Park
- MRL Parsing Without Tears: The Case of Hebrew
Shaltiel Shmidman, Avi Shmidman, Moshe Koppel, Reut Tsarfaty
- SyntaxShap: Syntax-aware Explainability Method for Text Generation
Kenza Amara, Rita Sevastjanova, Mennatallah El-Assady
- Enhancing Hyperbolic Knowledge Graph Embeddings via Lorentz Transformations
Xiran Fan, Minghua Xu, Huiyuan Chen, Yuzhong Chen, Mahashweta Das, Hao Yang
- Tell Me What’s Next: Textual Foresight for Generic UI Representations
Andrea Burns, Kate Saenko, Bryan A. Plummer
- Probing the Uniquely Identifiable Linguistic Patterns of Conversational AI Agents
Iqra Zahid, Tharindu Madusanka, Riza Batista-Navarro, Youcheng Sun
- The Butterfly Effect of Altering Prompts: How Small Changes and Jailbreaks Affect Large Language Model Performance
Abel Salinas, Fred Morstatter
- X-Shot: A Unified System to Handle Frequent, Few-shot and Zero-shot Learning Simultaneously in Classification
Hanzi Xu, Muhao Chen, Lifu Huang, Slobodan Vucetic, Wenpeng Yin
- SPIN: Sparsifying and Integrating Internal Neurons in Large Language Models for Text Classification
Difan Jiao, Yilun Liu, Zhenwei Tang, Daniel Matter, Jürgen Pfeffer, Ashton Anderson
- Decomposing Co-occurrence Matrices into Interpretable Components as Formal Concepts
Akihiro Maeda, Takuma Torii, Shohei Hidaka
- Two-Pronged Human Evaluation of ChatGPT Self-Correction in Radiology Report Simplification
Ziyu Yang, Santhosh Cherian, Slobodan Vucetic
- Planning First, Question Second: An LLM-Guided Method for Controllable Question Generation
Kunze Li, Yu Zhang
- RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback
Yanming Liu, Xinyue Peng, Xuhong Zhang, Weihao Liu, Jianwei Yin, Jiannan Cao, Tianyu Du
- MrRank: Improving Question Answering Retrieval System through Multi-Result Ranking Model
Danupat Khamnuansin, Tawunrat Chalothorn, Ekapol Chuangsuwanich
- Chain-of-Question: A Progressive Question Decomposition Approach for Complex Knowledge Base Question Answering
Peng Yixing, Quan Wang, Licheng Zhang, Yi Liu, Zhendong Mao
- Instruction Tuning with Retrieval-based Examples Ranking for Aspect-based Sentiment Analysis
Guangmin Zheng, Jin Wang, Liang-Chih Yu, Xuejie Zhang
- Unveiling the Truth and Facilitating Change: Towards Agent-based Large-scale Social Movement Simulation
Xinyi Mou, zhongyu wei, Xuanjing Huang
- Locating and Extracting Relational Concepts in Large Language Models
Zijian Wang, Britney Whyte, Chang Xu
- Unraveling and Mitigating Retriever Inconsistencies in Retrieval-Augmented Large Language Models
Mingda Li, Xinyu Li, Yifan Chen, Wenfeng Xuan, Weinan Zhang
- SenticVec: Toward Robust and Human-Centric Neurosymbolic Sentiment Analysis
Xulang Zhang, Rui Mao, Erik Cambria
- Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period of Large Language Models
Chen Qian, Jie Zhang, Wei Yao, Dongrui Liu, Zhenfei Yin, Yu Qiao, Yong Liu, Jing Shao
- Language Models can Evaluate Themselves via Probability Discrepancy
Tingyu Xia, Bowen Yu, Yuan Wu, Yi Chang, Chang Zhou
- Evaluating the Validity of Word-level Adversarial Attacks with Large Language Models
Huichi Zhou, Zhaoyang Wang, Hongtao Wang, Dongping Chen, Wenhan Mu, Fangyuan Zhang
- On the Language Encoder of Contrastive Cross-modal Models
Mengjie Zhao, Junya Ono, Zhi Zhong, Chieh-Hsin Lai, Yuhta Takida, Naoki Murata, Wei-Hsiang Liao, Takashi Shibuya, Hiromi Wakaki, Yuki Mitsufuji
- Your Co-Workers Matter: Evaluating Collaborative Capabilities of Language Models in Blocks World
Guande Wu, Chen Zhao, Claudio Silva, He He
- Anchor-based Large Language Models
Jianhui Pang, Fanghua Ye, Derek F. Wong, Xin He, Wanshun CHEN, Longyue Wang
- MLeVLM: Improve Multi-level Progressive Capabilities based on Multimodal Large Language Model for Medical Visual Question Answering
Dexuan Xu, Yanyuan Chen, Jieyi Wang, Yue Huang, Hanpin Wang, Zhi Jin, Hongxing Wang, Weihua Yue, Jing He, Hang Li, Yu Huang
- Disentangling Length from Quality in Direct Preference Optimization
Ryan Park, Rafael Rafailov, Stefano Ermon, Chelsea Finn
- MIKE: A New Benchmark for Fine-grained Multimodal Entity Knowledge Editing
Jiaqi Li, Miaozeng Du, Chuanyi Zhang, Yongrui Chen, Nan Hu, Guilin Qi, Haiyun Jiang, Siyuan Cheng, Bozhong Tian
- Reformulating Domain Adaptation of Large Language Models as Adapt-Retrieve-Revise: A Case Study on Chinese Legal Domain
Zhen Wan, Yating Zhang, Yexiang Wang, Fei Cheng, Sadao Kurohashi
- MemeMQA: Multimodal Question Answering for Memes via Rationale-Based Inferencing
Siddhant Agarwal, Shivam Sharma, Preslav Nakov, Tanmoy Chakraborty
- Improving Attributed Text Generation of Large Language Models via Preference Learning
Dongfang Li, Zetian Sun, Baotian Hu, zhenyu liu, Xinshuo Hu, Xuebo Liu, Min Zhang
- KOMBO: Korean Character Representations Based on the Combination Rules of Subcharacters
SungHo Kim, Juhyeong Park, Yeachan Kim, SangKeun Lee
- Tree-Planted Transformers: Unidirectional Transformer Language Models with Implicit Syntactic Supervision
Ryo Yoshida, Taiga Someya, Yohei Oseki
- Play Guessing Game with LLM: Indirect Jailbreak Attack with Implicit Clues
Zhiyuan Chang, Mingyang Li, Yi Liu, Junjie Wang, Qing Wang, Yang Liu
- Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical Notes
Sunjun Kweon, Junu Kim, Jiyoun Kim, Sujeong Im, Eunbyeol Cho, Seongsu Bae, Jungwoo Oh, Gyubok Lee, Jong Hak Moon, Seng Chan You, Seungjin Baek, Chang Hoon Han, YOON BIN JUNG, Yohan Jo, Edward Choi
- Extending Context Window of Large Language Models via Semantic Compression
Weizhi Fei, Xueyan Niu, Pingyi Zhou, Lu Hou, Bo Bai, Lei Deng, Wei Han
- Plausible Extractive Rationalization through Semi-Supervised Entailment Signal
Yeo Wei Jie, Ranjan Satapathy, Erik Cambria
- Translation Deserves Better: Analyzing Translation Artifacts in Cross-lingual Visual Question Answering
ChaeHun Park, Koanho Lee, Hyesu Lim, Jaeseok Kim, Junmo Park, Yu-Jung Heo, Du-Seong Chang, Jaegul Choo
- Scented-EAE: Stage-Customized Entity Type Embedding for Event Argument Extraction
Yu Yang, Jinyu Guo, Kai Shuang, Chenrui Mao
- Fast Randomized Low-Rank Adaptation of Pre-trained Language Models with PAC Regularization
Zijian Lei, Dong Qian, William K. Cheung
- SDA: Semantic Discrepancy Alignment for Text-conditioned Image Retrieval
Yuchen Yang, Yu Wang, Yanfeng Wang
- $Se^2$: Sequential Example Selection for In-Context Learning
Haoyu Liu, Jianfeng Liu, Shaohan Huang, Yuefeng Zhan, Hao Sun, Weiwei Deng, Furu Wei, Qi Zhang
- Generation Meets Verification: Accelerating Large Language Model Inference with Smart Parallel Auto-Correct Decoding
Hanling Yi, Feng Lin, Hongbin Li, Peiyang Ning, Xiaotian Yu, Rong Xiao
- StructEval: Deepen and Broaden Large Language Model Assessment via Structured Evaluation
Boxi Cao, Mengjie Ren, Hongyu Lin, Xianpei Han, Feng Zhang, Junfeng Zhan, Le Sun
- Mitigating Privacy Seesaw in Large Language Models: Augmented Privacy Neuron Editing via Activation Patching
Xinwei Wu, Weilong Dong, Shaoyang Xu, Deyi Xiong
- BadActs: A Universal Backdoor Defense in the Activation Space
Biao Yi, Sishuo Chen, Yiming Li, Tong Li, Baolei Zhang, Zheli Liu
- ReactXT: Understanding Molecular “Reaction-ship” via Reaction-Contextualized Molecule-Text Pretraining
Zhiyuan Liu, Yaorui Shi, An Zhang, Sihang Li, Enzhi Zhang, Xiang Wang, Kenji Kawaguchi, Tat-Seng Chua
- Multi-modal Concept Alignment Pre-training for Generative Medical Visual Question Answering
Quan Yan, Junwen Duan, Jianxin Wang
- Exploring Ordinality in Text Classification: A Comparative Study of Explicit and Implicit Techniques
Siva Rajesh Kasa, Aniket Goel, Karan Gupta, Sumegh Roychowdhury, Pattisapu Nikhil Priyatam, Anish bhanushali, Prasanna Srinivasa Murthy
- The Butterfly Effect of Model Editing: Few Edits Can Trigger Large Language Models Collapse
Wanli Yang, Fei Sun, Xinyu Ma, Xun Liu, Dawei Yin, Xueqi Cheng
- Can We Continually Edit Language Models? On the Knowledge Attenuation in Sequential Model Editing
Qi Li, Xiaowen Chu
- Before Generation, Align it! A Novel and Effective Strategy for Mitigating Hallucinations in Text-to-SQL Generation
Ge Qu, Jinyang Li, Bowen Li, Bowen Qin, Nan Huo, Chenhao Ma, Reynold Cheng
- Translatotron-V(ison): An End-to-End Model for In-Image Machine Translation
Zhibin Lan, Liqiang Niu, Fandong Meng, Jie Zhou, Min Zhang, Jinsong Su
- StatBot.Swiss: Bilingual Open Data Exploration in Natural Language
Farhad Nooralahzadeh, Yi Zhang, Ellery Smith, Sabine Maennel, Cyril Matthey-Doret, Raphaël De Fondeville, Kurt Stockinger
- Subtle Signatures, Strong Shields: Advancing Robust and Imperceptible Watermarking in Large Language Models
Yubing Ren, Ping Guo, Yanan Cao, Wei Ma
- Thinking about how to extract: Energizing LLMs’ emergence capabilities for document-level event argument extraction
Kai Shuang, zhouji, wang qiwei, Jinyu Guo
“* Improving the Robustness of Distantly-Supervised Named Entity Recognition via Uncertainty-Aware Teacher Learning
and Student-Student Collaborative Learning
Shuzheng Si, Helan Hu, Haozhe Zhao, Shuang Zeng, Kaikai An, Zefan Cai, Baobao Chang”
- SSS: Editing Factual Knowledge in Language Models towards Semantic Sparse Space
Huazheng Wang, Haifeng Sun, Jingyu Wang, Qi Qi, Zixuan Xia, Menghao Zhang, Jianxin Liao
- $\textit{GeoHard}$: Towards Measuring Class-wise Hardness through Modelling Class Semantics
Fengyu Cai, Xinran Zhao, Hongming Zhang, Iryna Gurevych, Heinz Koeppl
- Unveiling Selection Biases: Exploring Order and Token Sensitivity in Large Language Models
Sheng-Lun Wei, Cheng-Kuang Wu, Hen-Hsen Huang, Hsin-Hsi Chen
- ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic
Fajri Koto, Haonan Li, Sara Shatnawi, Jad Doughman, Abdelrahman Boda Sadallah, Aisha Alraeesi, Khalid Almubarak, Zaid Alyafeai, Neha Sengupta, Shady Shehata, Nizar Habash, Preslav Nakov, Timothy Baldwin
- On the Relationship Between RNN Hidden-State Vectors and Semantic Structures
Edi Muskardin, Martin Tappler, Ingo Pill, Bernhard K. Aichernig, Thomas Pock
- XMC-Agent : Dynamic Navigation over Scalable Hierarchical Index for Incremental Extreme Multi-label Classification
yanjiang liu, Tianyun Zhong, Yaojie Lu, Hongyu Lin, Ben He, Shuheng Zhou, Huijia Zhu, Weiqiang Wang, Zhongyi Liu, Xianpei Han, Le Sun
- Benchmarking Large Language Models on CFLUE - A Chinese Financial Language Understanding Evaluation Dataset
Jie Zhu, Junhui Li, yalong wen, Lifan Guo
- Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint
Zhipeng Chen, Kun Zhou, Xin Zhao, Junchen Wan, Fuzheng Zhang, Di ZHANG, Ji-Rong Wen
- Definition generation for lexical semantic change detection
Mariia Fedorova, Andrey Kutuzov, Yves Scherrer
- MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector
Marta R. Costa-jussà, Mariano Coria Meglioli, Pierre Andrews, David Dale, Prangthip Hansanti, Elahe Kalbassi, Alexandre Mourachko, Christophe Ropers, Carleigh Wood
- Phased Instruction Fine-Tuning for Large Language Models
Wei Pang, Chuan Zhou, Xiao-Hua Zhou, Xiaojie Wang
- TOREE: Evaluating Topic Relevance of Student Essays for Chinese Primary and Middle School Education
Xinlin Zhuang, Hongyi Wu, Xinshu Shen, Peimin Yu, Gaowei Yi, Xinhao Chen, Tu Hu, Yang Chen, Yupei Ren, Yadong Zhang, Youqi Song, Binxuan Liu, Man Lan
- Predicting the Unpredictable: Uncertainty-Aware Reasoning over Temporal Knowledge Graphs via Diffusion Process
Yuxiang Cai, Qiao Liu, Yanglei Gan, Changlin Li, Xueyi Liu, Run Lin, Da Luo, JiayeYang
- Asymmetric Bias in Text-to-Image Generation with Adversarial Attacks
Haz Sameen Shahgir, Xianghao Kong, Greg Ver Steeg, Yue Dong
- Controlled Text Generation for Large Language Model with Dynamic Attribute Graphs
Xun Liang, Hanyu Wang, Shichao Song, Mengting Hu, Xunzhi Wang, Zhiyu li, Feiyu Xiong, Bo Tang
- Coconut: Contextualized Commonsense Unified Transformers for Graph-Based Commonsense Augmentation of Language Models
Jun-Hyung Park, Mingyu Lee, Junho Kim, SangKeun Lee
- Mass-Editing Memory with Attention in Transformers: A cross-lingual exploration of knowledge
Daniel Tamayo Mela, Aitor Gonzalez-Agirre, Javier Hernando, Marta Villegas
- BioMistral: A Collection of Open-Source Pretrained Large Language Models for Medical Domains
Yanis Labrak, Adrien Bazoge, Emmanuel Morin, Pierre-Antoine GOURRAUD, Mickael Rouvier, Richard Dufour
- All Languages Matter: On the Multilingual Safety of LLMs
Wenxuan Wang, Zhaopeng Tu, Chang Chen, Youliang Yuan, Jen-tse Huang, Wenxiang Jiao, Michael Lyu
- LJPCheck: Functional Tests for Legal Judgment Prediction
Yuan Zhang, Wanhong Huang, Yi Feng, Chuanyi Li, Zhiwei Fei, Jidong Ge, Bin Luo, Vincent Ng
- CMDL: A Large-Scale Chinese Multi-Defendant Legal Judgment Prediction Dataset
Wanhong Huang, Yi Feng, Chuanyi Li, Honghan Wu, Jidong Ge, Vincent Ng
- Abstract Meaning Representation-Based Logic-Driven Data Augmentation for Logical Reasoning
Qiming Bao, Alex Yuxuan Peng, Zhenyun Deng, Wanjun Zhong, Gael Gendron, Timothy Pistotti, Neset TAN, Nathan Young, Yang Chen, Yonghua Zhu, Paul Denny, Michael Witbrock, Jiamou Liu
- CodeInsight: A Curated Dataset of Practical Coding Solutions from Stack Overflow
Nathanaël Beau, Benoit Crabbé
- ViHateT5: Enhancing Hate Speech Detection in Vietnamese With a Unified Text-to-Text Transformer Model
Luan Thanh Nguyen
- Bias in News Summarization: Measures, Pitfalls and Corpora
Julius Steen, Katja Markert
- When to Trust LLMs: Aligning Confidence with Response Quality
Shuchang Tao, Liuyi Yao, Hanxing Ding, Yuexiang Xie, Qi Cao, Fei Sun, Jinyang Gao, Huawei Shen, Bolin Ding
- Zero-shot Cross-lingual Alignment for Embedding Initialization
Xi Ai, Zhiyong Huang
- It takes two to borrow: a donor and a recipient. Who’s who?
Liviu P Dinu, Ana Sabina Uban, Anca Daniela Dinu, Ioan-Bogdan Iordache, Simona Georgescu, Laurentiu Zoicas
- Advancing Post-OCR Correction: A Comparative Study of Synthetic Data
Shuhao Guan, Derek Greene
- GeoAgent: To Empower LLMs using Geospatial Tools for Address Standardization
Chenghua Huang, Shisong Chen, Zhixu Li, Jianfeng Qu, Yanghua Xiao, Jiaxin Liu, Zhigang Chen
- HQP: A Human-Annotated Dataset for Detecting Online Propaganda
Abdurahman Maarouf, Dominik Bär, Dominique Geissler, Stefan Feuerriegel
- Teaching Language Models to Self-Improve by Learning from Language Feedback
Chi Hu, Yimin Hu, Hang Cao, Tong Xiao, JingBo Zhu
- Exploring Spatial Schema Intuitions in Large Language and Vision Models
Philipp Wicke, Lennart Wachowiak
- Efficient Detection of LLM-generated Texts with a Bayesian Surrogate Model
Yibo Miao, Hongcheng Gao, Hao Zhang, Zhijie Deng
- Decoding the Narratives: Analyzing Personal Drug Experiences Shared on Reddit
Layla Bouzoubaa, Elham Aghakhani, Max Song, Quang Minh Trinh, Shadi Rezapour
- Unveiling the Art of Heading Design: A Harmonious Blend of Summarization, Neology, and Algorithm
Shaobo Cui, Yiyang Feng, Yisong Mao, Yifan Hou, Boi Faltings
- Understanding Fine-grained Distortions in Reports of Scientific Findings
Amelie Wuehrl, Dustin Wright, Roman Klinger, Isabelle Augenstein
- MM-SOC: Benchmarking Multimodal Large Language Models in Social Media Platforms
Yiqiao Jin, Minje Choi, Gaurav Verma, Jindong Wang, Srijan Kumar
- Instances Need More Care: Rewriting Prompts for Instances with LLMs in the Loop Yields Better Zero-Shot Performance
Saurabh Srivastava, Chengyue Huang, Weiguo Fan, Ziyu Yao
- Benchmarking Retrieval-Augmented Generation for Medicine
Guangzhi Xiong, Qiao Jin, Zhiyong Lu, Aidong Zhang
- ChatMusician: Understanding and Generating Music Intrinsically with LLM
Ruibin Yuan, Hanfeng Lin, Yi Wang, Zeyue Tian, Shangda Wu, Tianhao Shen, Ge Zhang, Yuhang Wu, Cong Liu, Ziya Zhou, Liumeng Xue, Ziyang Ma, Qin Liu, Tianyu Zheng, Yizhi LI, Yinghao Ma, Yiming Liang, Xiaowei Chi, Ruibo Liu, Zili Wang, Chenghua Lin, Qifeng Liu, Tao Jiang, Wenhao Huang, Wenhu Chen, Jie Fu, Emmanouil Benetos, Gus Xia, Roger Dannenberg, Wei Xue, Shiyin Kang, Yike Guo
- Towards Robust Temporal Reasoning of Large Language Models via a Multi-Hop QA Dataset and Pseudo-Instruction Tuning
Qingyu Tan, Hwee Tou Ng, Lidong Bing
- Mind Your Format: Towards Consistent Evaluation of In-Context Learning Improvements
Anton Voronov, Lena Wolf, Max Ryabinin
- Knowledge Graph-Enhanced Large Language Models via Path Selection
Haochen Liu, Song Wang, Yaochen Zhu, Yushun Dong, Jundong Li
- OTTAWA: Optimal TransporT Adaptive Word Aligner for Hallucination and Omission Translation Errors Detection
Chenyang Huang, Abbas Ghaddar, Ivan Kobyzev, Mehdi Rezagholizadeh, Osmar Zaiane, Boxing Chen
- ONSEP: A Novel Online Neural-Symbolic Framework for Event Prediction Based on Large Language Model
Xuanqing Yu, Wangtao Sun, Jingwei Li, Kang Liu, Chengbao Liu, Jie Tan
- Speech-based Slot Filling using Large Language Models
Guangzhi Sun, Shutong Feng, Dongcheng Jiang, Chao Zhang, Milica Gasic, Phil Woodland
- Too Big to Fail: Larger Language Models are Disproportionately Resilient to Induction of Dementia-Related Linguistic Anomalies
Changye Li, Zhecheng Sheng, Trevor Cohen, Serguei V. S. Pakhomov
- TRAM: Benchmarking Temporal Reasoning for Large Language Models
Yuqing Wang, Yun Zhao
- Knowledge of Knowledge: Exploring Known-Unknowns Uncertainty with Large Language Models
Alfonso Amayuelas, Kyle Wong, Liangming Pan, Wenhu Chen, William Yang Wang
- Exploring Defeasibility in Causal Reasoning
Shaobo Cui, Lazar Milikic, Yiyang Feng, Mete Ismayilzada, Debjit Paul, Antoine Bosselut, Boi Faltings
- Better Synthetic Data by Retrieving and Transforming Existing Datasets
Saumya Sandipkumar Gandhi, Ritu Gala, Vijay Viswanathan, Tongshuang Wu, Graham Neubig
- Addressing Order Sensitivity of In-Context Demonstration Examples in Causal Language Models
Yanzheng Xiang, Hanqi Yan, Lin Gui, Yulan He
- Perspective Taking through Generating Responses to Conflict Situations
Joan Plepi, Charles Welch, Lucie Flek
- LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
Nicholas Lee, Thanakul Wattanawong, Sehoon Kim, Karttikeya Mangalam, Sheng Shen, Gopala Anumanchipalli, Michael W. Mahoney, Kurt Keutzer, Amir Gholami
- The Power of Summary-Source Alignments
Ori Ernst, Ori Shapira, Aviv Slobodkin, Sharon Adar, Mohit Bansal, Jacob Goldberger, Ran Levy, Ido Dagan
- An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models
Gantavya Bhatt, Yifang Chen, Arnav Mohanty Das, Jifan Zhang, Sang T. Truong, Stephen Mussmann, Yinglun Zhu, Jeff Bilmes, Simon Shaolei Du, Kevin Jamieson, Jordan T. Ash, Robert D Nowak
- Learning Multimodal Contrast with Cross-modal Memory and Reinforced Contrast Recognition
Yuanhe Tian, Fei Xia, Yan Song
- Text Simplification via Adaptive Teaching
Seyed Ali Bahrainian, Jonathan Dou, Carsten Eickhoff
- A multi-level multi-label text classification dataset of 19th century Ottoman and Russian literary and critical texts
Gokcen Gokceoglu, Devrim Çavuşoğlu, Emre Akbas, Özen Nergis Dolcerocca
- Whose Emotions and Moral Sentiments do Language Models Reflect?
Zihao He, Siyi Guo, Ashwin Rao, Kristina Lerman
- LLM can Achieve Self-Regulation via Hyperparameter Aware Generation
Siyin Wang, Shimin Li, Tianxiang Sun, Jinlan Fu, Qinyuan Cheng, Jiasheng Ye, Junjie Ye, Xipeng Qiu, Xuanjing Huang
- Forward-Backward Reasoning in Large Language Models for Mathematical Verification
Weisen Jiang, Han Shi, Longhui Yu, Zhengying Liu, Yu Zhang, Zhenguo Li, James Kwok
- Towards Uncertainty-Aware Language Agent
Jiuzhou Han, Wray Buntine, Ehsan Shareghi
- Detection and Positive Reconstruction of Cognitive Distortion Sentences: Mandarin Dataset and Evaluation
Shuya Lin, Yuxiong Wang, Jonathan Dong, Shiguang NI
- PiVe: Prompting with Iterative Verification Improving Graph-based Generative Capability of LLMs
Jiuzhou Han, Nigel Collier, Wray Buntine, Ehsan Shareghi
- Two-stage Generative Question Answering on Temporal Knowledge Graph Using Large Language Models
Yifu Gao, Linbo Qiao, Zhigang Kan, Zhihua Wen, Yongquan He, Dongsheng Li
- VISREAS: Complex Visual Reasoning with Unanswerable Questions
Syeda Nahida Akter, Sangwu Lee, Yingshan Chang, Yonatan Bisk, Eric Nyberg
- A Unified Generative Framework for Bilingual Euphemism Detection and Identification
Yuxue Hu, Junsong Li, Tongguan Wang, Dongyu Su, Guixin Su, Ying Sha
- StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing
Gaoxiang Cong, Yuankai Qi, Liang Li, Amin Beheshti, Zhedong Zhang, Anton van den Hengel, Ming-Hsuan Yang, Chenggang Yan, Qingming Huang
- ETAS: Zero-Shot Transformer Architecture Search via Network Trainability and Expressivity
Jiechao Yang, Yong Liu
- Reasoning Like a Doctor: Improving Medical Dialogue Systems via Diagnostic Reasoning Process Alignment
Kaishuai Xu, Yi Cheng, Wenjun Hou, Qiaoyu Tan, Wenjie Li
- ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models
Yanan Wu, Jie Liu, Xingyuan Bu, Jiaheng Liu, Zhanhui Zhou, Yuanxing Zhang, Chenchen Zhang, ZhiqiBai, Haibin Chen, Tiezheng Ge, Wanli Ouyang, Wenbo Su, Bo Zheng
- REInstruct: Building Instruction Data from Unlabeled Corpus
Shu Chen, Xinyan Guan, Yaojie Lu, Hongyu Lin, Xianpei Han, Le Sun
- Learning to Maximize Mutual Information for Chain-of-Thought Distillation
Xin Chen, Hanxian Huang, Yanjun Gao, Yi Wang, Jishen Zhao, Ke Ding
- PEMT: Multi-Task Correlation Guided Mixture-of-Experts Enables Parameter-Efficient Transfer Learning
Zhisheng Lin, Han Fu, Chenghao Liu, Zhuo Li, Jianling Sun
- MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark
Hongwei Liu, Zilong Zheng, Yuxuan Qiao, Haodong Duan, Zhiwei Fei, Fengzhe Zhou, Wenwei Zhang, Songyang Zhang, Dahua Lin, Kai Chen
- Identifying Semantic Induction Heads to Understand In-Context Learning
Jie Ren, Qipeng Guo, Hang Yan, Dongrui Liu, Quanshi Zhang, Xipeng Qiu, Dahua Lin
- Chinese Spelling Corrector Is Just a Language Learner
Lai Jiang, Hongqiu Wu, hai zhao, Min Zhang
- Logical Closed Loop: Uncovering Object Hallucinations in Large Vision-Language Models
Junfei Wu, Qiang Liu, Ding Wang, Jinghao Zhang, Shu Wu, Liang Wang, Tieniu Tan
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models
Xi Chen, Songyang Zhang, Qibing Bai, Kai Chen, Satoshi Nakamura
- Plan, Generate and Complicate: Improving Low-resource Dialogue State Tracking via Easy-to-Difficult Zero-shot Data Augmentation
Ming Gu, Yan Yang
- DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling
Shanghaoran Quan
- Comments as Natural Logic Pivots: Improve Code Generation via Comment Perspective
Yijie Chen, Yijin Liu, Fandong Meng, Yufeng Chen, Jinan Xu, Jie Zhou
- Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration
Sunhao Dai, Weihao Liu, Yuqi Zhou, Liang Pang, Rongju Ruan, Gang Wang, Zhenhua Dong, Jun Xu, Ji-Rong Wen
- Continual Dialogue State Tracking via Reason-of-Select Distillation
Yujie Feng, Bo LIU, Xiaoyu DONG, ZEXIN LU, Li-Ming Zhan, Xiao-Ming Wu, Albert Y.S. Lam
- Spotting AI’s Touch: Identifying LLM-Paraphrased Spans in Text
Yafu Li, Zhilin Wang, Leyang Cui, Wei Bi, Shuming Shi, Yue Zhang
- SoFA: Shielded On-the-fly Alignment via Priority Rule Following
Xinyu Lu, Bowen Yu, Yaojie Lu, Hongyu Lin, Haiyang Yu, Le Sun, Xianpei Han, Yongbin Li
- Modeling Emotional Trajectories in Written Stories Utilizing Transformers and Weakly-Supervised Learning
Lukas Christ, Shahin Amiriparian, Manuel Milling, Ilhan Aslan, Björn Schuller
- RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter
Meng Cao, Haoran Tang, Jinfa Huang, Peng Jin, Can Zhang, Ruyang Liu, Long Chen, Xiaodan Liang, Li Yuan, Ge Li
- Benchmarking and Improving Long-Text Translation with Large Language Models
Longyue Wang, Zefeng Du, Wenxiang Jiao, Chenyang Lyu, Jianhui Pang, Leyang Cui, Kaiqiang Song, Derek F. Wong, Shuming Shi, Zhaopeng Tu
- Personalized Topic Selection Model for Topic-Grounded Dialogue
Shixuan Fan, Wei Wei, Xiaofei Wen, Xian-Ling Mao, Jixiong Chen, Dangyang Chen
- Debiasing In-Context Learning by Instructing LLMs How to Follow Demonstrations
Lvxue Li, Jiaqi Chen, Xinyu Lu, Yaojie Lu, Hongyu Lin, Shuheng Zhou, Huijia Zhu, Weiqiang Wang, Zhongyi Liu, Xianpei Han, Le Sun
- Comparing Data Augmentation Methods for End-to-End Task-Oriented Dialog Systems
Christos Vlachos, Themos Stafylakis, Ion Androutsopoulos
- MS2SL: Multimodal Spoken Data-Driven Continuous Sign Language Production
Jian Ma, Wenguan Wang, Yi Yang, Feng Zheng
- BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language Models
Xueliang Zhao, Xinting Huang, Tingchen Fu, Qintong Li, Shansan Gong, Lemao Liu, Wei Bi, Lingpeng Kong
- PartialFormer: Modeling Part Instead of Whole for Machine Translation
Tong Zheng, Bei Li, Huiwen Bao, Jiale Wang, Weiqiao Shan, Tong Xiao, JingBo Zhu
- PACE: Improving Prompt with Actor-Critic Editing for Large Language Model
Yihong Dong, Kangcheng Luo, Xue Jiang, Zhi Jin, Ge Li
- Penetrative AI: Making LLMs Comprehend the Physical World
Huatao Xu, Liying Han, Qirui Yang, Mo Li, Mani Srivastava
- The Impact of Demonstrations on Multilingual In-Context Learning: A Multidimensional Analysis
Miaoran Zhang, Vagrant Gautam, Mingyang Wang, Jesujoba Oluwadara Alabi, Xiaoyu Shen, Dietrich Klakow, Marius Mosbach
- Rich Semantic Knowledge Enhanced Large Language Models for Few-shot Chinese Spell Checking
Ming Dong, Yujing Chen, Zhang Miao, Hao Sun, Tingting He
- An Empirical Study of In-context Learning in LLMs for Machine Translation
Pranjal A Chitale, Jay Gala, Raj Dabre
- ODA: Observation-Driven Agent for integrating LLMs and Knowledge Graphs
Lei Sun, Zhengwei Tao, Youdi Li, Hiroshi Arakawa
- A Comprehensive Study of Jailbreak Attack versus Defense for Large Language Models
Zihao Xu, Yi Liu, Gelei Deng, Yuekang Li, Stjepan Picek
- A Data-Driven Guided Decoding Mechanism for Diagnostic Captioning
Panagiotis Kaliosis, John Pavlopoulos, Foivos Charalampakos, Georgios Moschovis, Ion Androutsopoulos
- Balancing Speciality and Versatility: a Coarse to Fine Framework for Supervised Fine-tuning Large Language Model
Hengyuan Zhang, Yanru Wu, Dawei Li, Ziqing Yang, Rui Zhao, Yong Jiang, Fei Tan
- A Two-Agent Game for Zero-shot Relation Triplet Extraction
Ting Xu, Haiqin Yang, Fei Zhao, Zhen Wu, Xinyu Dai
- Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning
Naibin Gu, Peng Fu, Xiyu Liu, Bowen Shen, Zheng Lin, Weiping Wang
- Trust in Internal or External Knowledge? Generative Multi-Modal Entity Linking with Knowledge Retriever
Xinwei Long, Jiali Zeng, Fandong Meng, Jie Zhou, Bowen Zhou
- A Semantic Distance Metric Learning approach for Lexical Semantic Change Detection
Taichi Aida, Danushka Bollegala
- What Have We Achieved on Non-autoregressive Translation?
Yafu Li, Huajian Zhang, Jianhao Yan, Yongjing Yin, Yue Zhang
- Large Language Models Fall Short: Understanding Complex Relationships in Detective Narratives
Runcong Zhao, Qinglin Zhu, Hainiu Xu, Jiazheng Li, Yuxiang Zhou, Yulan He, Lin Gui
- DistillMIKE: Editing Distillation of Massive In-Context Knowledge Editing in Large Language Models
Shanbao Qiao, Xuebing Liu, Seung-Hoon Na
- Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative Decoding
Heming Xia, Zhe Yang, Qingxiu Dong, Peiyi Wang, Yongqi Li, Tao Ge, Tianyu Liu, Wenjie Li, Zhifang Sui
- Hierarchy-aware Biased Bound Margin Loss Function for Hierarchical Text Classification
Gibaeg Kim, SangHun Im, Heung-Seon Oh
- Improving Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts
Zhuo Chen, Xinyu Wang, Yong Jiang, Pengjun Xie, Fei Huang, Kewei Tu
- CICLe: Conformal In-Context Learning for Largescale Multi-Class Food Risk Classification
Korbinian Randl, John Pavlopoulos, Aron Henriksson, Tony Lindgren
- IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact
Ruikang Liu, Haoli Bai, Haokun Lin, Yuening Li, Han Gao, Zhengzhuo Xu, Lu Hou, Jun Yao, Chun Yuan
- Learning Adverbs with Spectral Mixture Kernels
Tomoe Taniguchi, Daichi Mochihashi, Ichiro Kobayashi
- E-EVAL: A Comprehensive Chinese K-12 Education Evaluation Benchmark for Large Language Models
Jinchang Hou, Chang Ao, Haihong Wu, Xiangtao Kong, Zhigang Zheng, Daijia Tang, Chengming Li, Xiping Hu, Ruifeng Xu, Shiwen Ni, Min Yang
- ChartAssistant: A Universal Chart Multimodal Language Model via Chart-to-Table Pre-training and Multitask Instruction Tuning
Fanqing Meng, Wenqi Shao, Quanfeng Lu, Peng Gao, Kaipeng Zhang, Yu Qiao, Ping Luo
- Teaching Small Language Models to Reason for Knowledge-Intensive Multi-Hop Question Answering
Xiang Li, Shizhu He, Fangyu Lei, JunYang, Tianhuang Su, Kang Liu, Jun Zhao
- ALaRM: Align Language Models via Hierarchical Rewards Modeling
Yuhang Lai, Siyuan Wang, Shujun Liu, Xuanjing Huang, zhongyu wei
- Mitigating Boundary Ambiguity and Inherent Bias for Text Classification in the Era of Large Language Models
Zhenyi Lu, Jie Tian, Wei Wei, Xiaoye Qu, Yu Cheng, Wenfeng xie, Dangyang Chen
- UOR: Universal Backdoor Attacks on Pre-trained Language Models
Wei Du, Peixuan Li, Haodong Zhao, Tianjie Ju, Ge Ren, Gongshen Liu
- Language models emulate certain cognitive profiles: An investigation of how predictability measures interact with individual differences
Patrick Haller, Lena Sophia Bolliger, Lena Ann Jäger
- NaturalCodeBench: Examining Coding Performance Mismatch on HumanEval and Natural User Queries
Shudan Zhang, Hanlin Zhao, Xiao Liu, Qinkai Zheng, Zehan Qi, Xiaotao Gu, Yuxiao Dong, Jie Tang
- LLMCrit: Teaching Large Language Models to Use Criteria
Weizhe Yuan, Pengfei Liu, Matthias Gallé
- Empowering cross-lingual abilities of instruction-tuned large language models by translation-following demonstrations
Leonardo Ranaldi, Giulia Pucci, Andre Freitas
- Ranking Entities along Conceptual Space Dimensions with LLMs: An Analysis of Fine-Tuning Strategies
Nitesh Kumar, Usashi Chatterjee, Steven Schockaert
- Efficient $k$-Nearest-Neighbor Machine Translation with Dynamic Retrieval
Yan Gao, Zhiwei Cao, Zhongjian Miao, Baosong Yang, Shiyu Liu, Min Zhang, Jinsong Su
- Symmetric Dot-Product Attention for Efficient Training of BERT Language Models
Martin Courtois, Malte Ostendorff, Leonhard Hennig, Georg Rehm
- Synthesizing Conversations from Unlabeled Documents using Automatic Response Segmentation
Fanyou Wu, Weijie Xu, Chandan K. Reddy, Srinivasan H. Sengamedu
- Can Large Language Models Follow Concept Annotation Guidelines? A Case Study on Scientific and Financial Domains
Marcio Fonseca, Shay B Cohen
- Alignment-Based Decoding Policy for Low-Latency and Anticipation-Free Neural Japanese Input Method Editors
Armin Sarhangzadeh, Taro Watanabe
- ECoK: Emotional Commonsense Knowledge Graph for Mining Emotional Gold
Zhunheng Wang, Xiaoyi Liu, Mengting Hu, Rui Ying, Ming Jiang, Jianfeng Wu, Yalan Xie, Hang Gao, Renhong Cheng
- Deterministic Reversible Data Augmentation for Neural Machine Translation
Jiashu Yao, Heyan Huang, Zeming Liu, Yuhang Guo
- Latent Learningscape Guided In-context Learning
Anlai Zhou, Sunshine Jiang, Yifei Liu, Yiquan Wu, Kun Kuang, Jun Xiao
- SMR: State Memory Replay for Long Sequence Modeling
Biqing Qi, Junqi Gao, Kaiyan Zhang, Dong Li, Jianxing Liu, Ligang Wu, Bowen Zhou
- Characterizing Large Language Models as Rationalizers of Knowledge-intensive Tasks
Aditi Mishra, Sajjadur Rahman, Kushan Mitra, Hannah Kim, Estevam Hruschka
- Challenging Large Language Models with New Tasks: A Study on their Adaptability and Robustness
CHENXI LI, Yuanhe Tian, Zhaxi Zerong, Yan Song, Fei Xia
- LLMs Beyond English: Scaling the Multilingual Capability of LLMs with Cross-Lingual Feedback
Wen Lai, Mohsen Mesgar, Alexander Fraser
- BASS: Batched Attention-optimized Speculative Sampling
Haifeng Qian, Sujan Kumar Gonugondla, Sungsoo Ha, Mingyue Shang, Sanjay Krishna Gouda, Ramesh Nallapati, Sudipta Sengupta, Xiaofei Ma, Anoop Deoras
- Deciphering Digital Detectives: Understanding LLM Behaviors and Capabilities in Multi-Agent Mystery Games
Dekun Wu, Haochen Shi, Zhiyuan Sun, Bang Liu
- It Is Not About What You Say, It Is About How You Say It: A Surprisingly Simple Approach for Improving Reading Comprehension
Sagi Shaier, Lawrence Hunter, Katharina von der Wense
- Large Language Models Relearn Removed Concepts
Michelle Wai Man Lo, Fazl Barez, Shay B Cohen
- Towards Unified Task Embeddings Across Multiple Models: Bridging the Gap for Prompt-Based Large Language Models and Beyond
Xinyu Wang, Hainiu Xu, Lin Gui, Yulan He
- TOAD: Task-Oriented Automatic Dialogs with Diverse Response Styles
Yinhong Liu, Yimai Fang, David Vandyke, Nigel Collier
- Machine-Generated Text Localization
Zhongping Zhang, Wenda Qin, Bryan A. Plummer
- BenchIE^FL: A Manually Re-Annotated Fact-Based Open Information Extraction Benchmark
Fabrice Lamarche, Philippe Langlais
- CausalCite: A Causal Formulation of Paper Citations
Ishan Kumar Agrawal, Zhijing Jin, Ehsan Mokhtarian, Siyuan Guo, Yuen Chen, Mrinmaya Sachan, Bernhard Schölkopf
- Question Translation Training for Better Multilingual Reasoning
Wenhao Zhu, Shujian Huang, Fei Yuan, Shuaijie She, Jiajun Chen, Alexandra Birch
- Improving LLM Generations via Fine-Grained Self-Endorsement
Ante Wang, Linfeng Song, Baolin Peng, Lifeng Jin, Ye Tian, Haitao Mi, Jinsong Su, Dong Yu
- Multi-Label Classification for Implicit Discourse Relation Recognition
Wanqiu Long, Siddharth N, Bonnie Webber
- StudentEval: A Benchmark of Student-Written Prompts for Large Language Models of Code
Hannah McLean Babe, Sydney Nguyen, Yangtian Zi, Arjun Guha, Molly Q Feldman, Carolyn Jane Anderson
- ProLex: A Benchmark for Language Proficiency-oriented Lexical Substitution
Xuanming Zhang, Zixun Chen, Zhou Yu
- Generating Diverse and High-Quality Texts by Minimum Bayes Risk Decoding
Yuu Jinnai, Ukyo Honda, Tetsuro Morimura, Peinan Zhang
- GATE X-E : A Challenge Set for Gender-Fair Translations from Weakly-Gendered Languages
Spencer Rarrick, Ranjita Naik, Sundar Poudel, Vishal Chowdhary
- Hyperparameter-Free Approach for Faster Minimum Bayes Risk Decoding
Yuu Jinnai, Kaito Ariu
- Simplifying Translations for Children: Iterative Simplification Considering Age of Acquisition with LLMs
Masashi Oshika, Makoto Morishita, Tsutomu Hirao, Ryohei Sasano, Koichi Takeda
- Bi-Chainer: Automated Large Language Models Reasoning with Bidirectional Chaining
Shuqi LIU, Bowei He, Linqi Song
- Can Large Language Model Summarizers Adapt to Diverse Scientific Communication Goals?
Marcio Fonseca, Shay B Cohen
- Knowledge Context Modeling with Pre-trained Language Models for Contrastive Knowledge Graph Completion
Guangqian Yang, Yi Liu, Lei Zhang, Licheng Zhang, Hongtao Xie, Zhendong Mao
- Stronger, Lighter, Better: Towards Life-Long Attribute Value Extraction for E-Commerce Products
TAO ZHANG, Chenwei Zhang, Xian Li, Jingbo Shang, Hoang H Nguyen, Philip S. Yu
- Generalized Category Discovery with Large Language Models in the Loop
Wenbin An, Wenkai Shi, Feng Tian, Haonan Lin, QianYing Wang, Yaqiang Wu, mingxiang cai, Luyan Wang, Yan Chen, Haiping Zhu, Ping Chen
- VAEGPT-Sim: Improving Sentence Representation with Limited Corpus Using Gradually-Denoising VAE
Zhenyi Wang, Haiyan Ning, Qing Ling, Dan Wang
- PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion
Yiduo Guo, Zekai Zhang, Yaobo Liang, Dongyan Zhao, Nan Duan
- Fact-and-Reflection (FaR) Improves Confidence Calibration of Large Language Models
Xinran Zhao, Hongming Zhang, Xiaoman Pan, Wenlin Yao, Dong Yu, Tongshuang Wu, Jianshu Chen
- DB-LLM: Accurate Dual-Binarization for Efficient LLMs
Hong Chen, Chengtao Lv, Liang Ding, Haotong Qin, Xiabin Zhou, Yifu Ding, Xuebo Liu, Min Zhang, Jinyang Guo, Xianglong Liu, Dacheng Tao
- TempCompass: Do Video LLMs Really Understand Videos?
Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, Lei Li, Sishuo Chen, Xu Sun, Lu Hou
- Teaching Large Language Models an Unseen Language on the Fly
Chen Zhang, Xiao Liu, Jiuheng Lin, Yansong Feng
- Error Analysis Prompting Enables Human-Like Translation Evaluation in Large Language Models
Qingyu Lu, Baopu Qiu, Liang Ding, Kanjian Zhang, Tom Kocmi, Dacheng Tao
- DiffChat: Learning to Chat with Text-to-Image Synthesis Models for Interactive Image Creation
Jiapeng Wang, Chengyu Wang, Tingfeng Cao, Jun Huang, Lianwen Jin
- Rationales for Answers to Simple Math Word Problems Confuse Large Language Models
Yidan Zhang, Mingfeng Xue, Dayiheng Liu, Zhenan He
- ResLoRA: Identity Residual Mapping in Low-Rank Adaption
Shuhua Shi, Shaohan Huang, Minghui Song, Zhoujun Li, Zihan Zhang, Haizhen Huang, Furu Wei, Weiwei Deng, Feng Sun, Qi Zhang
- Towards Objectively Benchmarking Social Intelligence of Language Agents at the Action Level
Chenxu Wang, Bin Dai, Huaping Liu, Baoyuan Wang
- Semantic Role Labeling from Chinese Speech via End-to-End Learning
Huiyao Chen, Xinxin Li, Meishan Zhang, Min Zhang
- MEEL: Multi-Modal Event Evolution Learning
Zhengwei Tao, Zhi Jin, Junqiang Huang, Xiancai Chen, Xiaoying Bai, Yifan Zhang, Chongyang Tao
- LLM-REDIAL: A Large-Scale Dataset for Conversational Recommender Systems Created from User Behaviors with LLMs
Tingting Liang, Chenxin Jin, Lingzhi Wang, Wenqi Fan, Congying Xia, Kai Chen, Yuyu Yin
- Investigating Subtler Biases in LLMs: Ageism, Beauty, Institutional, and Nationality Bias in Generative Models
Mahammed Kamruzzaman, Md. Minul Islam Shovon, Gene Louis Kim
- EVIT: Event-Oriented Instruction Tuning for Event Reasoning
Zhengwei Tao, Xiancai Chen, Zhi Jin, Xiaoying Bai, Haiyan Zhao, Yiwei Lou
- InstructCMP: Length Control in Sentence Compression through Instruction-based Large Language Models
Juseon-Do, Hidetaka Kamigaito, Manabu Okumura, Jingun Kwon
- SymTax: Symbiotic Relationship and Taxonomy Fusion for Effective Citation Recommendation
Karan Goyal, Mayank Goel, Vikram Goyal, Mukesh Mohania
- Assessing News Thumbnail Representativeness: Counterfactual text can enhance the cross-modal matching ability
Yejun Yoon, Seunghyun Yoon, Kunwoo Park
- Towards Better Question Generation in QA-based Event Extraction
Zijin Hong, Jian Liu
- Budget-Constrained Tool Learning with Planning
Yuanhang Zheng, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu
- TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wild
Huayang Li, Siheng Li, Deng Cai, Longyue Wang, Lemao Liu, Taro Watanabe, Yujiu Yang, Shuming Shi
- The Critique of Critique
Shichao Sun, Junlong Li, Weizhe Yuan, Ruifeng Yuan, Wenjie Li, Pengfei Liu
- CoCo-Agent: A Comprehensive Cognitive MLLM Agent for Smartphone GUI Automation
Xinbei Ma, Zhuosheng Zhang, hai zhao
- FRVA: Fact-Retrieval and Verification Augmented Entailment Tree Generation for Explainable Question Answering
Yue Fan, Hu zhang, Ru Li, YuJie Wang, Hongye Tan, Jiye Liang
- P4: Plug-and-Play Discrete Prompting for Large Language Models Personalization
Yuansen Zhang, Xiao Wang, Tianze Chen, Jiayi Fu, Tao Gui, Qi Zhang
- RRNorm: A Novel Framework for Chinese Disease Diagnoses Normalization via LLM-Driven Terminology Component Recognition and Reconstruction
Yongqi Fan, yansha zhu, KUI XUE, Jingping Liu, Tong Ruan
- Unexpected Phenomenon: LLMs’ Spurious Associations in Information Extraction
Weiyan Zhang, Wanpeng Lu, Jiacheng Wang, Yating Wang, Lihan Chen, Haiyun Jiang, Jingping Liu, Tong Ruan
- AutoCAP: Towards Automatic Cross-lingual Alignment Planning for Zero-shot Chain-of-Thought
Yongheng Zhang, Qiguang Chen, Min Li, Wanxiang Che, Libo Qin
- LCS: A Language Converter Strategy for Zero-Shot Neural Machine Translation
Zengkui Sun, Yijin Liu, Fandong Meng, Jinan Xu, Yufeng Chen, Jie Zhou
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data
Xiao Liu, Zirui Wu, Xueqing Wu, Pan Lu, Kai-Wei Chang, Yansong Feng
- On the Vulnerability of Safety Alignment in Open-Access LLMs
Jingwei Yi, Rui Ye, Qisi Chen, Bin Benjamin Zhu, Siheng Chen, Defu Lian, Guangzhong Sun, Xing Xie, Fangzhao Wu
- PEK: A Parameter-Efficient Framework for Knowledge-Grounded Dialogue Generation
Pan Yang, Dandan Song, Zhijing Wu, Yanru Zhou
- Outdated Issue Aware Decoding for Factual Knowledge Editing
Zengkui Sun, Yijin Liu, Jiaan Wang, Fandong Meng, Jinan Xu, Yufeng Chen, Jie Zhou
- Disentangling Dialect from Social Bias via Multitask Learning to Improve Fairness
Maximilian Spliethöver, Sai Nikhil Menon, Henning Wachsmuth
- DP-MLM: Differentially Private Text Rewriting Using Masked Language Models
Stephen Meisenbacher, Maulik Chevli, Juraj Vladika, Florian Matthes
- Question-Instructed Visual Descriptions for Zero-Shot Video Answering
David Orlando Romero Mogrovejo, Thamar Solorio
- EX-FEVER: A Dataset for Multi-hop Explainable Fact Verification
Huanhuan Ma, Weizhi Xu, Yifan Wei, Liuji Chen, Liang Wang, Qiang Liu, Shu Wu, Liang Wang
- Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models
Zehui Chen, Kuikun Liu, Qiuchen Wang, Wenwei Zhang, Jiangning Liu, Dahua Lin, Kai Chen, Feng Zhao
- Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification
Ekaterina Fadeeva, Aleksandr Rubashevskii, Artem Shelmanov, Sergey Petrakov, Haonan Li, Hamdy Mubarak, Evgenii Tsymbalov, Gleb Kuzmin, Alexander Panchenko, Timothy Baldwin, Preslav Nakov, Maxim Panov
- Deciphering the Impact of Pretraining Data on Large Language Models through Machine Unlearning
yang zhao, Li Du, Xiao Ding, Kai Xiong, Zhouhao Sun, Shi jun, Ting Liu, Bing Qin
- Critical Learning Periods: Leveraging Early Training Dynamics for Efficient Data Pruning
Everlyn Asiko Chimoto, Jay Gala, Orevaoghene Ahia, Julia Kreutzer, Bruce Bassett, Sara Hooker
- What Are You Token About? Differentiable Perturbed Top-$k$ Token Selection for Scientific Document Summarization
Luca Ragazzi, Paolo Italiani, Gianluca Moro, Mattia Panni
- Description Boosting for Zero-Shot Entity and Relation Classification
Gabriele Picco, Leopold Fuchs, Marcos Martínez Galindo, Alberto Purpura, Vanessa López, Hoang Thanh Lam
- Domain-Aware $k$-Nearest-Neighbor Knowledge Distillation for Machine Translation
Zhexuan Wang, Shudong Liu, Xuebo Liu, Miao Zhang, Derek F. Wong, Min Zhang
- Beyond Single-Event Extraction: Towards Efficient Document-Level Multi-Event Argument Extraction
Wanlong Liu, Li Zhou, DingYi Zeng, Yichen Xiao, Shaohuan Cheng, Chen Zhang, Grandee Lee, Malu Zhang, Wenyu Chen
- Revisiting Interpolation Augmentation for Speech-to-Text Generation
Chen Xu, Jie Wang, Xiaoqian Liu, Qian qian Dong, Chunliang Zhang, Tong Xiao, JingBo Zhu, Dapeng Man, Wu Yang
- Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk
Dennis Thomas Ulmer, Elman Mansimov, Kaixiang Lin, Lijia Sun, Xibin Gao, Yi Zhang
- Semantic are Beacons: A Semantic Perspective for Unveiling Parameter-Efficient Fine-Tuning in Knowledge Learning
Renzhi Wang, Piji Li
- Leveraging Collection-Wide Similarities for Unsupervised Document Structure Extraction
Gili Lior, Yoav Goldberg, Gabriel Stanovsky
- Enhancing Cross Text-Molecule Learning by Self-Augmentation
Jiang Yinuo, Xiang Zhuang, Keyan Ding, Qiang Zhang, Huajun Chen
- RePALM: Popular Quote Tweet Generation via Auto-Response Augmentation
Erxin Yu, Jing Li, Chunpu Xu
- On the Effect of (Near) Duplicate Subwords in Language Modelling
Anton Schäfer, Thomas Hofmann, Imanol Schlag, Tiago Pimentel
- Do Pre-Trained Language Models Detect and Understand Semantic Underspecification? Ask the DUST!
Frank Wildenburg, Michael Hanna, Sandro Pezzelle
- Visual Hallucinations of Multi-modal Large Language Models
Wen Huang, Hongbin Liu, Minxin Guo, Neil Zhenqiang Gong
- SumSurvey: An Abstractive Dataset of Scientific Survey Papers for Long Document Summarization
Ran Liu, Ming Liu, Min Yu, He Zhang, Jianguo Jiang, Gang Li, Weiqing Huang
- Understanding and Patching Compositional Reasoning in LLMs
Zhaoyi Li, Gangwei Jiang, Hong Xie, Linqi Song, Defu Lian, Ying Wei
- Bilingual Rhetorical Structure Parsing with Large Parallel Annotations
Elena Chistova
- Book2Dial: Generating Teacher Student Interactions from Textbooks for Cost-Effective Development of Educational Chatbots
Junling Wang, Jakub Macina, Nico Daheim, Sankalan Pal Chowdhury, Mrinmaya Sachan
- SELP: A Semantically-Driven Approach for Separated and Accurate Class Prototypes in Few-Shot Text Classification
Wenxin Liang, Tingyu Zhang, Han Liu, Feng Zhang
- Automated Focused Feedback Generation for Scientific Writing Assistance
Eric Chamoun, Michael Sejr Schlichtkrull, Andreas Vlachos
- FastGAS: Fast Graph-based Annotation Selection for In-Context Learning
Zihan Chen, Song Wang, Cong Shen, Jundong Li
- Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activations
Bowen Shen, Zheng Lin, Daren Zha, Wei Liu, Jian Luan, Bin Wang, Weiping Wang
- Deductive Closure Training of Language Models for Coherence, Accuracy, and Updatability
Afra Feyza Akyürek, Ekin Akyürek, Leshem Choshen, Derry Wijaya, Jacob Andreas
- Self-Supervised Singing Voice Pre-Training towards Speech-to-Singing Conversion
Ruiqi Li, Rongjie Huang, Yongqi Wang, Zhiqing Hong, Zhou Zhao
- Evaluating Large Language Model Biases in Persona-Steered Generation
Andy Liu, Mona T. Diab, Daniel Fried
- Leveraging Entity Information for Cross-Modality Correlation Learning: The Entity-Guided Multimodal Summarization
Yanghai Zhang, Ye Liu, Shiwei Wu, Kai Zhang, Xukai Liu, Qi Liu, Enhong Chen
- CR-UTP: Certified Robustness against Universal Text Perturbations on Large Language Models
Qian Lou, Xin Liang, Jiaqi Xue, Yancheng Zhang, Rui Xie, Mengxin Zheng
- Recovering document annotations for sentence-level bitext
Rachel Wicks, Matt Post, Philipp Koehn
- MetaPro 2.0: Computational Metaphor Processing on the Effectiveness of Anomalous Language Modeling
Rui Mao, Kai He, Claudia Beth Ong, Qian Liu, Erik Cambria
- Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling
Shenzhi Wang, Chang Liu, Zilong Zheng, Siyuan Qi, Shuo Chen, Qisen Yang, Andrew Zhao, Chaofei Wang, Shiji Song, Gao Huang
- Direct Preference Optimization with an Offset
Afra Amini, Tim Vieira, Ryan Cotterell
- TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head Translation
Xize Cheng, Rongjie Huang, Linjun Li, Zehan Wang, Tao Jin, Aoxiong Yin, Chen Feiyang, Xinyu Duan, Baoxing Huai, Zhou Zhao
- More than Minorities and Majorities: Understanding Multilateral Bias in Language Generation
Jiaxu Zhao, Zijing Shi, Yitong Li, Yulong Pei, Ling Chen, Meng Fang, Mykola Pechenizkiy
- Fair Federated Learning with Biased Vision-Language Models
Huimin Zeng, Zhenrui Yue, Yang Zhang, Lanyu Shang, Dong Wang
- SpeechGuard: Exploring the Adversarial Robustness of Multi-modal Large Language Models
Raghuveer Peri, Sai Muralidhar Jayanthi, Srikanth Ronanki, Anshu Bhatia, Karel Mundnich, Saket Dingliwal, Nilaksh Das, Zejiang Hou, Goeric Huybrechts, Srikanth Vishnubhotla, Daniel Garcia-Romero, Sundararajan Srinivasan, Kyu J. Han, Katrin Kirchhoff
- ACUEval: Fine-grained Hallucination Evaluation and Correction for Abstractive Summarization
David Wan, Koustuv Sinha, Srini Iyer, Asli Celikyilmaz, Mohit Bansal, Ramakanth Pasunuru
- An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models
Xiongtao Zhou, Jie He, Yuhua Ke, Guangyao Zhu, Victor Gutierrez Basulto, Jeff Z. Pan
- PARADISE: Evaluating Implicit Planning Skills of Language Models with Procedural Warnings and Tips Dataset
Arda Uzunoglu, Gözde Gül Şahin, Abdulfattah Safa
- TURNA: A Turkish Encoder-Decoder Language Model for Enhanced Understanding and Generation
Gökçe Uludoğan, Zeynep Yirmibeşoğlu Balal, Furkan Akkurt, Meliksah Turker, Onur Gungor, Susan Üsküdarlı
- From Discrimination to Generation: Low-Resource Intent Detection with Language Model Instruction Tuning
Feng Zhang, Wei Chen, Fei Ding, Meng Gao, Tengjiao Wang, Jiahui Yao, Jiabin Zheng
- Efficient Continual Pre-training for Building Domain Specific Large Language Models
Yong Xie, Karan Aggarwal, Aitzaz Ahmad
- Distantly-Supervised Joint Extraction with Noise-Robust Learning
Yufei Li, Xiao Yu, Yanghong Guo, Yanchi Liu, Haifeng Chen, Cong Liu
- LLM Factoscope: Uncovering LLMs’ Factual Discernment through Measuring Inner States
Jinwen He, Yujia Gong, Zijin Lin, Cheng’an Wei, Yue Zhao, Kai Chen
- DictLLM: Harnessing Key-Value Data Structures with Large Language Models for Enhanced Medical Diagnostics
YiQiu Guo, Yuchen Yang, Ya Zhang, Yu Wang, Yanfeng Wang
- imapScore: Medical Fact Evaluation Made Easy
Huimin WANG, Yutian Zhao, Xian Wu, Yefeng Zheng
- Making Harmful Behaviors Unlearnable for Large Language Models
Xin Zhou, Yi Lu, Ruotian Ma, Yujian Wei, Tao Gui, Qi Zhang, Xuanjing Huang
- Debiasing Large Language Models with Structured Knowledge
Congda MA, Tianyu Zhao, Manabu Okumura
- Contrastive Instruction Tuning
Tianyi Yan, Fei Wang, James Y. Huang, Wenxuan Zhou, Fan Yin, Aram Galstyan, Wenpeng Yin, Muhao Chen
- Bootstrapped Pre-training with Dynamic Identifier Prediction for Generative Retrieval
Yubao Tang, Ruqing Zhang, Jiafeng Guo, Maarten de Rijke, Yixing Fan, Xueqi Cheng
- Refining and Synthesis: A Simple yet Effective Data Augmentation Framework for Cross-Domain Aspect-based Sentiment Analysis
Haining Wang, Kang He, Bobo Li, Lei Chen, Fei Li, Xu Han, Chong Teng, Donghong Ji
- Codec-SUPERB: An In-Depth Analysis of Sound Codec Models
Haibin Wu, Ho-Lam Chung, Yi-Cheng Lin, Yuan-Kuei Wu, Xuanjun Chen, Yu-Chi Pai, Hsiu-Hsuan Wang, Kai-Wei Chang, Alexander H. Liu, Hung-yi Lee
- CACL: Community-Aware Heterogeneous Graph Contrastive Learning for Social Media Bot Detection
Sirry Chen, Shuo Feng, Liang Songsong, Chen-Chen Zong, Jing Li, Piji Li
- Are Machines Better at Complex Reasoning? Unveiling Human-Machine Inference Gaps in Entailment Verification
Soumya Sanyal, Tianyi Xiao, Jiacheng Liu, Wenya Wang, Xiang Ren
- ChartInstruct: Instruction Tuning for Chart Comprehension and Reasoning
Ahmed Masry, Mehrad Shahmohammadi, Md Rizwan Parvez, Enamul Hoque, Shafiq Joty
- Improving Multilingual Neural Machine Translation by Utilizing Semantic and Linguistic Features
Mengyu Bu, Shuhao Gu, Yang Feng
- Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Architecture-Routed Mixture-of-Experts
Ganesh Jawahar, Haichuan Yang, Yunyang Xiong, Zechun Liu, Dilin Wang, Fei Sun, Meng Li, Aasish Pappu, Barlas Oguz, Muhammad Abdul-Mageed, Laks V. S. Lakshmanan, Raghuraman Krishnamoorthi, Vikas Chandra
- SharedCon: Implicit Hate Speech Detection using Shared Semantics
Hyeseon Ahn, Youngwook Kim, Jungin Kim, Yo-Sub Han
- Smaller Language Models are capable of selecting Instruction-Tuning Training Data for Larger Language Models
Dheeraj Mekala, Alex Nguyen, Jingbo Shang
- InjecAgent: Benchmarking Indirect Prompt Injections in Tool-Integrated Large Language Model Agents
Qiusi Zhan, Zhixiang Liang, Zifan Ying, Daniel Kang
- Generalization-Enhanced Code Vulnerability Detection via Multi-Task Instruction Fine-Tuning
Xiaohu Du, Ming Wen, Jiahao Zhu, Zifan Xie, Bin Ji, Huijun Liu, Xuanhua Shi, Hai Jin
- PPTSER: A Plug-and-Play Tag-guided Method for Few-shot Semantic Entity Recognition on Visually-rich Documents
Wenhui Liao, Jiapeng Wang, Zening Lin, Longfei Xiong, Lianwen Jin
- LLM Performance Predictors are good initializers for Architecture Search
Ganesh Jawahar, Muhammad Abdul-Mageed, Laks V. S. Lakshmanan, Dujian Ding
- MODDP: A Multi-modal Open-domain Chinese Dataset for Dialogue Discourse Parsing
Chen Gong, DeXin Kong, Suxian Zhao, Xingyu Li, Guohong Fu
- Chinese MentalBERT: Domain-Adaptive Pre-training on Social Media for Chinese Mental Health Text Analysis
Wei Zhai, Hongzhi Qi, Qing Zhao, Jianqiang Li, Ziqi Wang, Han Wang, Bing Xiang Yang, Guanghui FU
- Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization
Zhanhui Zhou, Jie Liu, Jing Shao, Xiangyu Yue, Chao Yang, Wanli Ouyang, Yu Qiao
- DORY: Deliberative Prompt Recovery for LLM
Lirong Gao, Ru Peng, Yiming Zhang, Junbo Zhao
- STYLE: Improving Domain Transferability of Asking Clarification Questions in Large Language Model Powered Conversational Agents
Yue Chen, Chen Huang, Yang Deng, Wenqiang Lei, Dingnan Jin, Jia Liu, Tat-Seng Chua
- Evaluating Robustness of Generative Search Engine on Adversarial Factoid Questions
Xuming Hu, Xiaochuan Li, Junzhe Chen, Yinghui Li, Yangning Li, Xiaoguang Li, Yasheng Wang, Qun Liu, Lijie Wen, Philip S. Yu, Zhijiang Guo
- Automatic Engineering of Long Prompts
Cho-Jui Hsieh, Si Si, Felix Yu, Inderjit S Dhillon
- AS-ES Learning: Towards efficient CoT learning in small models
Nuwa Xi, Yuhan Chen, Sendong Zhao, Haochun Wang, GongZhang, Bing Qin, Ting Liu
- II-MMR: Identifying and Improving Multi-modal Multi-hop Reasoning in Visual Question Answering
Jihyung Kil, Farideh Tavazoee, Dongyeop Kang, Joo-Kyung Kim
- TAME-RD: Text Assisted Replication of Image Multi-Adjustments for Reverse Designing
Pooja Guhan, Uttaran Bhattacharya, Somdeb Sarkhel, Vahid Azizi, Xiang Chen, Saayan Mitra, Aniket Bera, Dinesh Manocha
- Batch-ICL: Effective, Efficient, and Order-Agnostic In-Context Learning
Kaiyi Zhang, Ang Lv, Yuhan Chen, Hansen Ha, Tao XU, Rui Yan
- IndicVoices: Towards building an Inclusive Multilingual Speech Dataset for Indian Languages
Tahir Javed, Janki Atul Nawale, Eldho Ittan George, Sakshi Joshi, Kaushal Santosh Bhogale, Deovrat Mehendale, Ishvinder Virender Sethi, Aparna Ananthanarayanan, Hafsah Faquih, Pratiti Palit, Sneha Ravishankar, Saranya Sukumaran, Tripura Panchagnula, Sunjay Murali, Kunal Sharad Gandhi, Ambujavalli R, Manickam K M, C Venkata Vaijayanthi, Krishnan Srinivasa Raghavan Karunganni, Pratyush Kumar, Mitesh M Khapra
- ViCor: Bridging Visual Understanding and Commonsense Reasoning with Large Language Models
Kaiwen Zhou, Kwonjoon Lee, Teruhisa Misu, Xin Eric Wang
- Decomposition for Enhancing Attention: Improving LLM-based Text-to-SQL through Workflow Paradigm
Yuanzhen Xie, Xinzhou Jin, Tao Xie, matrixmxlin, Liang Chen, Chenyun Yu, Cheng lei, Chengxiang Zhuo, Bo Hu, Zang Li
- Unveiling Opinion Evolution via Prompting and Diffusion for Short Video Fake News Detection
Linlin Zong, Jiahui Zhou, Wenmin Lin, Xinyue Liu, Xianchao Zhang, Bo Xu
- iSign: A Benchmark for Indian Sign Language Processing
Abhinav Joshi, Romit Mohanty, Mounika Kanakanti, Andesha Mangla, Sudeep Choudhary, Monali Barbate, Ashutosh Modi
- Data Contamination Calibration for Black-box LLMs
Wentao Ye, Jiaqi Hu, Liyao Li, Haobo Wang, Gang Chen, Junbo Zhao
- Truth-Aware Context Selection: Mitigating Hallucinations of Large Language Models Being Misled by Untruthful Contexts
Tian Yu, Shaolei Zhang, Yang Feng
- Efficiently Exploring Large Language Models for Document-Level Machine Translation with In-context Learning
Menglong Cui, Jiangcun Du, shaolin Zhu, Deyi Xiong
- Improving Grammatical Error Correction via Contextual Data Augmentation
Yixuan Wang, Baoxin Wang, Yijun Liu, Qingfu Zhu, Dayong Wu, Wanxiang Che
- RECOST: External Knowledge Guided Data-efficient Instruction Tuning
Qi Zhang, Yiming Zhang, Haobo Wang, Junbo Zhao
- Understanding Cross-Lingual Alignment—A Survey
Katharina Hämmerl, Jindřich Libovický, Alexander Fraser
- Mitigate Negative Transfer with Similarity Heuristic Lifelong Prompt Tuning
Chenyuan Wu, Gangwei Jiang, Defu Lian
- PANDA: Preference Adaptation for Enhancing Domain-Specific Abilities of LLMs
An Liu, Zonghan Yang, Zhenhe Zhang, Qingyuan Hu, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Yang Liu
- Developing PUGG for Polish: A Modern Approach to KBQA, MRC, and IR Dataset Construction
Albert Sawczyn, Katsiaryna Viarenich, Konrad Wojtasik, Aleksandra Domogała, Marcin Oleksy, Maciej Piasecki, Tomasz Jan Kajdanowicz
- Knowledge-to-SQL: Enhancing SQL Generation with Data Expert LLM
Zijin Hong, Zheng Yuan, Hao Chen, Qinggang Zhang, Feiran Huang, Xiao Huang
- Enhancing Distractor Generation for Multiple-Choice Questions with Retrieval Augmented Pretraining and Knowledge Graph Integration
Han Cheng Yu, Yu An Shih, Kin Man Law, KaiYu Hsieh, Yu Chen Cheng, Hsin Chih Ho, Zih An Lin, WEN-CHUAN HSU, Yao-Chung Fan
- Exploiting Positional Bias for Query-Agnostic Generative Content in Search
Andrew Parry, Sean MacAvaney, Debasis Ganguly
- ICC : Quantifying Image Caption Concreteness for Multimodal Dataset Curation
Moran Yanuka, Morris Alper, Hadar Averbuch-Elor, Raja Giryes
- On LLMs-Driven Synthetic Data Generation, Curation, and Evaluation: A Survey
Lin Long, Rui Wang, Ruixuan Xiao, Junbo Zhao, Xiao Ding, Gang Chen, Haobo Wang
- Accelerating Multilingual Language Model for Excessively Tokenized Languages
Jimin Hong, Gibbeum Lee, Jaewoong Cho
- Distillation Enhanced Generative Retrieval
Yongqi Li, Zhen Zhang, Wenjie Wang, Liqiang Nie, Wenjie Li, Tat-Seng Chua
- ToxVidLM: A Multimodal Framework for Toxicity Detection in Code-Mixed Videos
Krishanu Maity, A.S. Poornash, Sriparna Saha, Pushpak Bhattacharyya
- StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models
Zhicheng Guo, Sijie Cheng, Hao Wang, Shihao Liang, Yujia Qin, Peng Li, Zhiyuan Liu, Maosong Sun, Yang Liu
- Both Matter: Enhancing the Emotional Intelligence of Large Language Models without Compromising the General Intelligence
Weixiang Zhao, Zhuojun Li, Shilong Wang, Yang Wang, Yulin Hu, Yanyan Zhao, Chen Wei, Bing Qin
- KorNAT: LLM Alignment Benchmark for Korean Social Values and Common Knowledge
Jiyoung Lee, Minwoo Kim, Seungho Kim, Junghwan Kim, Seunghyun Won, Hwaran Lee, Edward Choi
- Enhancing Adverse Drug Event Detection with Multimodal Dataset: Corpus Creation and Model Development
Pranab Sahoo, Ayush Kumar Singh, Sriparna Saha, Aman Chadha, Samrat Mondal
- Space Decomposition for Sentence Embedding
Wuttikorn Ponwitayarat, Peerat Limkonchotiwat, Ekapol Chuangsuwanich, Sarana Nutanong
- Improving Low-Resource Machine Translation for Formosan Languages Using Bilingual Lexical Resources
Francis Zheng, Edison Marrese-Taylor, Yutaka Matsuo
- CMMLU: Measuring massive multitask language understanding in Chinese
Haonan Li, Yixuan Zhang, Fajri Koto, Yifei Yang, hai zhao, Yeyun Gong, Nan Duan, Timothy Baldwin
- Prometheus-Vision: Vision-Language Model as a Judge for Fine-Grained Evaluation
Seongyun Lee, Seungone Kim, Sue Hyun Park, Geewook Kim, Minjoon Seo
- Evaluating Mathematical Reasoning of Large Language Models: A Focus on Error Identification and Correction
Xiaoyuan Li, Wenjie Wang, Moxin Li, Junrong Guo, Yang Zhang, Fuli Feng
- Less is KEN: a Universal and Simple Non-Parametric Pruning Algorithm for Large Language Models
Michele Mastromattei, Fabio Massimo Zanzotto
- When Do LLMs Need Retrieval Augmentation? Mitigating LLMs’ Overconfidence Helps Retrieval Augmentation
Shiyu Ni, Keping Bi, Jiafeng Guo, Xueqi Cheng
- Hybrid Alignment Training for Large Language Models
Chenglong Wang, Hang Zhou, Kaiyan Chang, Bei Li, Yongyu Mu, Tong Xiao, Tongran Liu, JingBo Zhu
- Graph-Structured Speculative Decoding
Zhuocheng Gong, Jiahao Liu, Ziyue Wang, Pengfei Wu, Jingang Wang, Xunliang Cai, Dongyan Zhao, Rui Yan
- Duwak: Dual Watermarks in Large Language Models
Chaoyi Zhu, Jeroen M. Galjaard, Pin-Yu Chen, Lydia Y. Chen
- CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion
Qibing Ren, Chang Gao, Jing Shao, Junchi Yan, Xin Tan, Wai Lam, Lizhuang Ma
- Mitigating Reversal Curse in Large Language Models via Semantic-aware Permutation Training
Qingyan Guo, Rui Wang, Junliang Guo, Xu Tan, Jiang Bian, Yujiu Yang
- wav2vec-S: Adapting Pre-trained Speech Models for Streaming
Biao Fu, Kai Fan, Minpeng Liao, Yidong Chen, Xiaodong Shi, Zhongqiang Huang
- Peering into the Mind of Language Models: An Approach for Attribution in Contextual Question Answering
Anirudh Phukan, Shwetha S, Apoorv Saxena, Koustava Goswami, Balaji Vasan Srinivasan
- TRAP: Targeted Random Adversarial Prompt Honeypot for Black-Box Identification
Martin Gubri, Dennis Thomas Ulmer, Hwaran Lee, Sangdoo Yun, Seong Joon Oh
- CLASP: Cross-modal Alignment Using Pre-trained Unimodal Models
Jianing Zhou, Ziheng Zeng, Hongyu Gong, Suma Bhat
- TimeToM: Temporal Space is the Key to Unlocking the Door of Large Language Models’ Theory-of-Mind
Guiyang Hou, Wenqi Zhang, Yongliang Shen, Linjuan Wu, Weiming Lu
- Identifying and Mitigating Annotation Bias in Natural Language Understanding using Causal Mediation Analysis
Sitiporn Sae Lim, Can Udomcharoenchaikit, Peerat Limkonchotiwat, Ekapol Chuangsuwanich, Sarana Nutanong
- Perturbed examples reveal invariances shared by language models
Ruchit Rawal, Mariya Toneva
- Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation
Yiwei Li, Fei Mi, Yitong Li, Yasheng Wang, Bin Sun, Shaoxiong Feng, Kan Li
- Discourse Structure-Aware Prefix for Generation-Based End-to-End Argumentation Mining
Yang Sun, Guanrong Chen, Caihuayang, Jianzhu Bao, Bin Liang, Xi Zeng, Min Yang, Ruifeng Xu
- Poor-Supervised Evaluation for SuperLLM via Mutual Consistency
Peiwen Yuan, Shaoxiong Feng, Yiwei Li, Xinglin Wang, Boyuan Pan, Heda Wang, Yao Hu, Kan Li
- Addressing Entity Translation Problem via Translation Difficulty and Context Diversity
Tian Liang, Xing Wang, Mingming Yang, Yujiu Yang, Shuming Shi, Zhaopeng Tu
- ADAM: Dense Retrieval Distillation with Adaptive Dark Examples
Chongyang Tao, Chang Liu, Tao Shen, Can Xu, Xiubo Geng, Binxing Jiao, Daxin Jiang
- Instruction Position Matters in Sequence Generation with Large Language Models
Yijin Liu, Xianfeng Zeng, Chenze Shao, Fandong Meng, Jie Zhou
- XMoE: Sparse Models with Fine-grained and Adaptive Expert Selection
Yuanhang Yang, Shiyi Qi, Wenchao Gu, Chaozheng Wang, Cuiyun Gao, Zenglin Xu
- BranchNorm: Robustly Scaling Extremely Deep Transformers
Yijin Liu, Xianfeng Zeng, Fandong Meng, Jie Zhou
- MusTQ: A Temporal Knowledge Graph Question Answering Dataset for Multi-Step Temporal Reasoning
Tingyi Zhang, Jiaan Wang, Zhixu Li, Jianfeng Qu, An Liu, Zhigang Chen, Hongping Zhi
- Deal, or no deal (or who knows)? Forecasting Uncertainty in Conversations using Large Language Models
Anthony Sicilia, Hyunwoo Kim, Khyathi Chandu, Malihe Alikhani, Jack Hessel
- Knowledge Fusion By Evolving Weights of Language Models
Guodong DU, Jing Li, Hanting Liu, Runhua Jiang, Shuyang Yu, Yifei Guo, Sim Kuan Goh, Ho-Kin Tang
- ScaLearn: Simple and Highly Parameter-Efficient Task Transfer by Learning to Scale
Markus Frohmann, Carolin Holtermann, Shahed Masoudian, Anne Lauscher, Navid Rekabsaz
- Visualizing Dialogues: Enhancing Image Selection through Dialogue Understanding with Large Language Models
Chang-Sheng Kao, Yun-Nung Chen
- MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data Visualization
Zhiyu Yang, Zihan Zhou, Shuo Wang, Xin Cong, Xu Han, Yukun Yan, Zhenghao Liu, Zhixing Tan, Pengyuan Liu, Dong Yu, Zhiyuan Liu, Xiaodong Shi, Maosong Sun
- Continual Few-shot Relation Extraction via Adaptive Gradient Correction and Knowledge Decomposition
hu jianpeng, Chengxiang Tan, JiaCheng Xu, XiangyunKong
- CMoralEval: A Moral Evaluation Benchmark for Chinese Large Language Models
Linhao Yu, Yongqi Leng, Yufei Huang, Shang Wu, Haixin Liu, Xinmeng Ji, Jiahui Zhao, Jinwang Song, Tingting Cui, Xiaoqing Cheng, Liutao, Deyi Xiong
- Cache & Distil: Optimising API Calls to Large Language Models
Guillem Ramírez, Matthias Lindemann, Alexandra Birch, Ivan Titov
- Investigating the Impact of Model Instability on Explanations and Uncertainty
Sara Vera Marjanovic, Isabelle Augenstein, Christina Lioma
- A Two-Stage Adaptation of Large Language Models for Text Ranking
Longhui Zhang, Yanzhao Zhang, Dingkun Long, Pengjun Xie, Meishan Zhang, Min Zhang
- Fine-tuning with HED-IT: The impact of human post-editing for dialogical language models
Daniela Occhipinti, Michele Marchi, Irene Mondella, Huiyuan Lai, Felice Dell’Orletta, Malvina Nissim, Marco Guerini
- Analyze, Generate and Refine: Query Expansion with LLMs for Zero-Shot Open-Domain QA
Xinran Chen, Xuanang Chen, Ben He, Tengfei Wen, Le Sun
- On the Evaluation of Speech Foundation Models for Spoken Language Understanding
Siddhant Arora, Ankita Pasad, Chung-Ming Chien, Jionghao Han, Roshan Sharma, Jee-weon Jung, Hira Dhamyal, William Chen, Suwon Shon, Hung-yi Lee, Karen Livescu, Shinji Watanabe
- Towards Multiple References Era – Addressing Data Leakage and Limited Reference Diversity in Machine Translation Evaluation
Xianfeng Zeng, Yijin Liu, Fandong Meng, Jie Zhou
- Prompting open-source and commercial language models for grammatical error correction of English learner text
Christopher Davis, Andrew Caines, O Andersen, Shiva Taslimipoor, Helen Yannakoudakis, Zheng Yuan, Christopher Bryant, Marek Rei, Paula Buttery
- BATS: BenchmArking Text Simplicity 🦇
Christin Katharina Kreutz, Fabian Haak, Björn Engelmann, Philipp Schaer
- Discovering influential text using convolutional neural networks
Megan Ayers, Luke Sanford, Margaret Roberts, Eddie Yang
- Generalization or Memorization: Data Contamination and Trustworthy Evaluation for Large Language Models
Yihong Dong, Xue Jiang, Huanyu Liu, Zhi Jin, Bin Gu, Mengfei Yang, Ge Li
- Efficient Training of Language Models with Compact and Consistent Next Token Distributions
Ashutosh Sathe, Sunita Sarawagi
- Ancient Chinese Glyph Identification Powered by Radical Semantics
Yang Chi, Fausto Giunchiglia, Chuntao Li, Hao Xu
- PUB: A Pragmatics Understanding Benchmark for Assessing LLMs’ Pragmatics Capabilities
Settaluri Lakshmi Sravanthi, Meet Doshi, Pavan Kalyan Tankala, Rudra Murthy, Raj Dabre, Pushpak Bhattacharyya
- EmoTransKG: An Innovative Emotion Knowledge Graph to Reveal Emotion Transformation
Huan Zhao, Xupeng Zha, Zixing Zhang
- How Vocabulary Sharing Facilitates Multilingualism in LLaMA?
Fei Yuan, Shuai Yuan, Zhiyong Wu, Lei Li
- Prefix Text as a Yarn: Eliciting Non-English Alignment in Foundation Language Model
Runzhe Zhan, Xinyi Yang, Derek F. Wong, Lidia S. Chao, Yue Zhang
- Dual Prompt Tuning based Contrastive Learning for Hierarchical Text Classification
Sishi Xiong, Yu Zhao, Jie Zhang, Li Mengxiang, Zhongjiang He, Xuelong Li, Shuangyong Song
- Probing the Emergence of Cross-lingual Alignment during LLM Training
Hetong Wang, Pasquale Minervini, Edoardo Ponti
- STSPL-SSC: Semi-Supervised Few-Shot Short Text Clustering with Semantic text similarity Optimized Pseudo-Labels
Wenhua Nie, Lin Deng, Chang-Bo Liu, JialingWei, Ruitong Han, Haoran Zheng
- A Comprehensive Evaluation of Quantization Strategies for Large Language Models
Renren Jin, Jiangcun Du, Wuwei Huang, Wei Liu, Jian Luan, Bin Wang, Deyi Xiong
- Exploiting Target Language Data for Neural Machine Translation Beyond Back Translation
Abudurexiti Reheman, yingfeng luo, Junhao Ruan, Chunliang Zhang, Anxiang Ma, Tong Xiao, JingBo Zhu
- Bayesian Prompt Ensembles: Model Uncertainty Estimation for Black-Box Large Language Models
Francesco Tonolini, Nikolaos Aletras, Jordan Massiah, Gabriella Kazai
- X-ACE: Explainable and Multi-factor Audio Captioning Evaluation
Qian Wang, Jia-Chen Gu, Zhen-Hua Ling
- Reasons to Reject? Aligning Language Models with Judgments
Weiwen Xu, Deng Cai, Zhisong Zhang, Wai Lam, Shuming Shi
- Decomposing Argumentative Essay Generation via Dialectical Planning of Complex Reasoning
Yuhang He, Jianzhu Bao, Yang Sun, Bin Liang, Min Yang, Bing Qin, Ruifeng Xu
- Large Language Models are Few-Shot Training Example Generators: A Case Study in Fallacy Recognition
Tariq Alhindi, Smaranda Muresan, Preslav Nakov
- Concept-aware Data Construction Improves In-context Learning of Language Models
Michal Štefánik, Marek Kadlčík, Petr Sojka
- Non-Autoregressive Machine Translation as Constrained HMM
Haoran Li, Zhanming Jie, Wei Lu
- Multi-modal Stance Detection: New Datasets and Model
Bin Liang, Ang Li, Jingqian Zhao, Lin Gui, Min Yang, Yue Yu, Kam-Fai Wong, Ruifeng Xu
- Enhanced Language Model Truthfulness with Learnable Intervention and Uncertainty Expression
Farima Fatahi Bayat, Xin Liu, H. V. Jagadish, Lu Wang
- MM-LLMs: Recent Advances in MultiModal Large Language Models
Duzhen Zhang, Yahan Yu, Jiahua Dong, Chenxing Li, Dan Su, Chenhui Chu, Dong Yu
- CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models
Yizhi LI, Ge Zhang, Xingwei Qu, Jiali Li, ZHAOQUN LI, Noah Wang, Hao Li, Ruibin Yuan, Yinghao Ma, Kai Zhang, Wangchunshu Zhou, Yiming Liang, Lei Zhang, Lei Ma, Jiajun Zhang, Zuowen Li, Wenhao Huang, Chenghua Lin, Jie Fu
- Countering Reward Over-Optimization in LLM with Demonstration-Guided Reinforcement Learning
Mathieu Rita, Florian Strub, Rahma Chaabouni, Paul Michel, Emmanuel Dupoux, Olivier Pietquin
- Enhancing Idiomatic Representation in Multiple Languages via an Adaptive Contrastive Triplet Loss
Wei He, Marco Idiart, Carolina Scarton, Aline Villavicencio
- AdaLomo: Low-memory Optimization with Adaptive Learning Rate
Kai Lv, Hang Yan, Qipeng Guo, haijun Lv, Xipeng Qiu
- Propagation and Pitfalls: Reasoning-based Assessment of Knowledge Editing through Counterfactual Tasks
Wenyue Hua, Jiang Guo, Mingwen Dong, Henghui Zhu, Patrick Ng, Zhiguo Wang
- Exciting Mood Changes: A Time-aware Hierarchical Transformer for Change Detection Modelling
Anthony Hills, Talia Tseriotou, Xenia Miscouridou, Adam Tsakalidis, Maria Liakata
- CorNav: Autonomous Agent with Self-Corrected Planning for Zero-Shot Vision-and-Language Navigation
Xiwen Liang, Liang Ma, Shanshan Guo, Jianhua Han, Hang Xu, Shikui Ma, Xiaodan Liang
- SciMMIR: Benchmarking Scientific Multi-modal Information Retrieval
Siwei Wu, Yizhi LI, Kang Zhu, Ge Zhang, Yiming Liang, Kaijing Ma, Chenghao Xiao, Haoran Zhang, Bohao Yang, Wenhu Chen, Wenhao Huang, Noura Al Moubayed, Jie Fu, Chenghua Lin
- Learning to Generate Instruction Tuning Datasets for Zero-Shot Task Adaptation
Nihal V. Nayak, Yiyang Nan, Avi Trost, Stephen Bach
- Demonstrations Are All You Need: Advancing Offensive Content Paraphrasing using In-Context Learning
Anirudh Som, Karan Sikka, Helen Gent, Ajay Divakaran, Andreas Kathol, Dimitra Vergyri
- Paying Attention to Deflections: Mining Pragmatic Nuances for Whataboutism Detection in Online Discourse
Khiem Dinh Phi, Noushin Salek Faramarzi, Chenlu Wang, Ritwik Banerjee
- Epistemology of Language Models: Do Language Models Have Holistic Knowledge?
Minsu Kim, James Thorne
- Strong hallucinations from negation and how to fix them
Swarnadeep Bhar, Nicholas Asher
- LLMs as Narcissistic Evaluators: When Ego Inflates Evaluation Scores
Yiqi Liu, Nafise Sadat Moosavi, Chenghua Lin
- HelloFresh: LLM Evalutions on Streams of Real-World Human Editorial Actions across X Community Notes and Wikipedia edits
Tim Franzmeyer, Aleksandar Shtedritski, Samuel Albanie, Philip Torr, Joao F. Henriques, Jakob Nicolaus Foerster
- Chaos with Keywords: Exposing Large Language Models Sycophancy to Misleading Keywords and Evaluating Defense Strategies
Aswin RRV, Nemika Tyagi, Md Nayem Uddin, Neeraj Varshney, Chitta Baral
- Empowering Large Language Models for Textual Data Augmentation
Yichuan Li, Kaize Ding, Jianling Wang, Kyumin Lee
- Choose Your Transformer: Improved Transferability Estimation of Transformer Models on Classification Tasks
Lukas Garbaciauskas, Max Ploner, Alan Akbik
- CaLM: Contrasting Large and Small Language Models to Verify Grounded Generation
I-Hung Hsu, Zifeng Wang, Long Le, Lesly Miculicich, Nanyun Peng, Chen-Yu Lee, Tomas Pfister
- TextEE: Benchmark, Reevaluation, Reflections, and Future Challenges in Event Extraction
Kuan-Hao Huang, I-Hung Hsu, Tanmay Parekh, Zhiyu Xie, Zixuan Zhang, Prem Natarajan, Kai-Wei Chang, Nanyun Peng, Heng Ji
- OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement
Tianyu Zheng, Ge Zhang, Tianhao Shen, Xueling Liu, Bill Yuchen Lin, Jie Fu, Wenhu Chen, Xiang Yue
- Measuring and Addressing Indexical Bias in Information Retrieval
Caleb Ziems, William Barr Held, Jane Dwivedi-Yu, Diyi Yang
- CIDAR: Culturally Relevant Instruction Dataset For Arabic
Zaid Alyafeai, Khalid Almubarak, Ahmed Ashraf, Deema Alnuhait, Saied Alshahrani, Gubran A.Q. Abdulrahman, Gamil Ahmed, Qais Gawah, Zead Saleh, Mustafa Ghaleb, Yousef Ali, Maged S. Al-shaibani
- RadGraph-XL: A Large-Scale Expert-Annotated Dataset for Entity and Relation Extraction from Radiology Reports
Jean-Benoit Delbrouck, Pierre Joseph Marcel Chambon, Zhihong Chen, Maya Varma, Andrew Johnston, Louis Blankemeier, Dave Van Veen, Tan Bui, Steven Truong, Curtis Langlotz
- SMART: Submodular Data Mixture Strategy for Instruction Tuning
H S V N S Kowndinya Renduchintala, Sumit Bhatia, Ganesh Ramakrishnan
- Selective “Selective Prediction”: Reducing Unnecessary Abstention in Vision-Language Reasoning
Tejas Srinivasan, Jack Hessel, Tanmay Gupta, Bill Yuchen Lin, Yejin Choi, Jesse Thomason, Khyathi Chandu
- Differentially Private Knowledge Distillation via Synthetic Text Generation
James Flemings, Murali Annavaram
- KIWI: A Dataset of Knowledge-Intensive Writing Instructions for Answering Research Questions
Fangyuan Xu, Kyle Lo, Luca Soldaini, Bailey Kuehl, Eunsol Choi, David Wadden
- XL-HeadTags: Leveraging Multimodal Retrieval Augmentation for the Multilingual Generation of News Headlines and Tags
Faisal Tareque Shohan, Mir Tafseer Nayeem, Samsul Islam, Abu Ubaida Akash, Shafiq Joty
- InFoBench: Evaluating Instruction Following Ability in Large Language Models
Yiwei Qin, Kaiqiang Song, Yebowen Hu, Wenlin Yao, Sangwoo Cho, Xiaoyang Wang, Xuansheng Wu, Fei Liu, Pengfei Liu, Dong Yu
- EcoRank: Budget-Constrained Text Re-ranking Using Large Language Models
Muhammad Shihab Rashid, Jannat Ara Meem, Yue Dong, Vagelis Hristidis
- FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models
Gagan Bhatia, El Moatez Billah Nagoudi, Hasan Cavusoglu, Muhammad Abdul-Mageed
- Aligning Large Multimodal Models with Factually Augmented RLHF
Zhiqing Sun, Sheng Shen, Shengcao Cao, Haotian Liu, Chunyuan Li, Yikang Shen, Chuang Gan, Liangyan Gui, Yu-Xiong Wang, Yiming Yang, Kurt Keutzer, Trevor Darrell
- The Art of Defending: A Systematic Evaluation and Analysis of LLM Defense Strategies on Safety and Over-Defensiveness
Neeraj Varshney, Pavel Dolin, Agastya Seth, Chitta Baral
- PAT-Questions: A Self-Updating Benchmark for Present-Anchored Temporal Question-Answering
Jannat Ara Meem, Muhammad Shihab Rashid, Yue Dong, Vagelis Hristidis
- $360^\circ$REA: Towards A Reusable Experience Accumulation with $360^\circ$ Assessment for Multi-Agent System
Shen Gao, Hao Li, Zhengliang Shi, Chengrui Huang, Quan Tu, Shuo Shang, Zhiliang Tian, Minlie Huang
- Extracting Polymer Nanocomposite Samples from Full-Length Documents
Ghazal Khalighinejad, Defne Circi, L. Brinson, Bhuwan Dhingra
- Leveraging LLM Reasoning Enhances Personalized Recommender Systems
Alicia Y. Tsai, Adam Kraft, Long Jin, Chenwei Cai, Anahita Hosseini, Taibai Xu, Zemin Zhang, Lichan Hong, Ed H. Chi, Xinyang Yi
- Toucan: Many-to-Many Translation for 150 African Language Pairs
AbdelRahim A. Elmadany, Ife Adebara, Muhammad Abdul-Mageed
- Evaluating Structural Generalization in Neural Machine Translation
Ryoma Kumon, Daiki Matsuoka, Hitomi Yanaka
- Figuratively Speaking: Authorship Attribution via Multi-Task Figurative Language Modeling
Gregorios A Katsios, Ning Sa, Tomek Strzalkowski
- CHAMP: A Competition-level Dataset for Fine-Grained Analyses of LLMs’ Mathematical Reasoning Capabilities
Yujun Audrey Mao, Yoon Kim, Yilun Zhou
- Improving Machine Translation with Large Language Models: A Preliminary Study with Cooperative Decoding
Jiali Zeng, Fandong Meng, Yongjing Yin, Jie Zhou
- Integrating Pre-Trained Speech and Language Models for End-to-End Speech Recognition
Yukiya Hono, Koh Mitsuda, Tianyu Zhao, Kentaro Mitsui, Toshiaki Wakatsuki, Kei Sawada
- Proving membership in LLM pretraining data via data watermarks
Johnny Wei, Ryan Yixiang Wang, Robin Jia
- SecFormer: Fast and Accurate Privacy-Preserving Inference for Transformer Models via SMPC
Jinglong Luo, Yehong Zhang, Zhuo Zhang, Jiaqi Zhang, Xin Mu, Hui Wang, Yue Yu, Zenglin Xu
- Raccoon: Prompt Extraction Benchmark of LLM-Integrated Applications
Junlin Wang, Tianyi Yang, Roy Xie, Bhuwan Dhingra
- History-Aware Conversational Dense Retrieval
Fengran Mo, Chen Qu, Kelong Mao, Tianyu Zhu, Zhan Su, Kaiyu Huang, Jian-Yun Nie
- Light Up the Shadows: Enhance Long-Tailed Entity Grounding with Concept-Guided Vision-Language Models
Yikai Zhang, Qianyu He, Xintao Wang, Siyu Yuan, Jiaqing Liang, Yanghua Xiao
- ZeroStance: Leveraging ChatGPT for Open-Domain Stance Detection via Dataset Generation
Chenye Zhao, Yingjie Li, Cornelia Caragea, Yue Zhang
- Boosting Zero-Shot Crosslingual Performance using LLM-Based Augmentations with Effective Data Selection
Barah Fazili, Ashish Sunil Agrawal, Preethi Jyothi
- Reinforcement Tuning for Detecting Stances and Debunking Rumors Jointly with Large Language Models
Ruichao Yang, Wei Gao, Jing Ma, Hongzhan Lin, Bo Wang
- Exploring the Potential of Dense Information in Multimodal Alignment
Zhiyuan Fan, Zhihong Chen, Benyou Wang
- InstructEval: Instruction-Tuned Text Evaluator from Human Preference
Wenhao Wu, Wei Li, Xinyan Xiao, Jiachen Liu, Sujian Li
- A Curious Case of Searching for the Correlation between Training Data and Adversarial Robustness of Transformer Textual Models
Dang Cao Cuong, Dung D. Le, Thai Le
- InstructGraph: Boosting Large Language Models via Graph-centric Instruction Tuning and Preference Alignment
Jianing Wang, Junda Wu, Yupeng Hou, Yao Liu, Ming Gao, Julian McAuley
- RaDA: Retrieval-augmented Web Agent Planning with LLMs
Minsoo Kim, Victor Bursztyn, Eunyee Koh, Shunan Guo, seung-won hwang
- Competition-Level Problems are Effective LLM Evaluators
Yiming Huang, Zhenghao Lin, Xiao Liu, Yeyun Gong, Shuai Lu, Fangyu Lei, Yaobo Liang, yelong shen, Chen Lin, Nan Duan, Weizhu Chen
- Large Language Models for Automated Open-domain Scientific Hypotheses Discovery
Zonglin Yang, Xinya Du, JUNXIAN LI, Jie Zheng, Soujanya Poria, Erik Cambria
- GRADUAL: Granularity-aware Dual Prototype Learning for Better Few-Shot Relation Extraction
Zhiming Li, Yuchen Lyu
- Training a Better Chinese Spelling Correction Model via Prior-knowledge Guided Teacher
Chi Wei, shaobin huang, Rongsheng Li, Naiyu Yan, Rui Wang
- The Revolution of Multimodal Large Language Models: A Survey
Davide Caffagni, Federico Cocchi, Luca Barsellotti, Nicholas Moratelli, Sara Sarto, Lorenzo Baraldi, Lorenzo Baraldi, Marcella Cornia, Rita Cucchiara
- OOP: Object-Oriented Programming Evaluation Benchmark for Large Language Models
Shuai Wang, Liang Ding, Li Shen, Yong Luo, Bo Du, Dacheng Tao
- Code Needs Comments: Enhancing Code LLMs with Comment Augmentation
Demin Song, Honglin Guo, Yunhua Zhou, Shuhao Xing, Yudong Wang, Zifan Song, Wenwei Zhang, Qipeng Guo, Hang Yan, Xipeng Qiu, Dahua Lin
- Efficient Domain Adaptation for Non-Autoregressive Machine Translation
WangJie You, Pei Guo, Juntao Li, Kehai Chen, Min Zhang
- Exploring Reversal Mathematical Reasoning Ability for Large Language Models
Pei Guo, WangJie You, Juntao Li, Yan Bowen, Min Zhang
- A Unified Joint Approach with Topological Context Learning and Rule Augmentation for Knowledge Graph Completion
Jingtao Guo, Chunxia Zhang, Lingxi Li, Xiaojun Xue, Zhendong Niu
- FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation
Tu Vu, Mohit Iyyer, Xuezhi Wang, Noah Constant, Jerry Wei, Jason Wei, Chris Tar, Yun-Hsuan Sung, Denny Zhou, Quoc V Le, Thang Luong
- ROSE Doesn’t Do That: Boosting the Safety of Instruction-Tuned Large Language Models with Reverse Prompt Contrastive Decoding
Qihuang Zhong, Liang Ding, Juhua Liu, Bo Du, Dacheng Tao
- CR-LLM: A Dataset and Optimization for Concept Reasoning of Large Language Models
Nianqi Li, Jingping Liu, Sihang Jiang, Haiyun Jiang, Yanghua Xiao, Jiaqing Liang, Zujie Liang, Feng Wei, Jinglei Chen, ZHENGHONG HAO, Bing Han
- DATA-CUBE: Data Curriculum for Instruction-based Sentence Representation Learning
Yingqian Min, Kun Zhou, Dawei Gao, Xin Zhao, He Hu, Yaliang Li
- Combating Label Sparsity in Short Text Topic Modeling via Nearest Neighbor Augmentation
Yang Lin, Xinyu Ma, Xin Gao, Ruiqing Li, Yasha Wang, Xu Chu
- RefuteBench: Evaluating Refuting Instruction-Following for Large Language Models
Jianhao Yan, Yun Luo, Yue Zhang
- Complex Logical Query Answering by Calibrating Knowledge Graph Completion Models
Changyi Xiao, Yixin Cao
- Argument-Based Sentiment Analysis on Forward-Looking Statements
Chin-Yi Lin, Chung-Chi Chen, Hen-Hsen Huang, Hsin-Hsi Chen
- Paying More Attention to Source Context: Mitigating Unfaithful Translations from Large Language Model
Hongbin Zhang, Kehai Chen, Xuefeng Bai, Yang Xiang, Min Zhang
- Unveiling the Power of Integration: Block Diagram Summarization through Local-Global Fusion
Shreyanshu Bhushan, Eun-Soo Jung, Minho Lee
- MultiSQL: A Schema-Integrated Context-Dependent Text2SQL Dataset with Diverse SQL Operations
Chunhui Li, Yifan Wang, Zhen Wu, Zhen Yu, Fei Zhao, Shujian Huang, Xinyu Dai
- Towards Demonstration-Aware Large Language Models for Machine Translation
Chen Li, Meishan Zhang, Xuebo Liu, Zhaocong Li, Derek F. Wong, Min Zhang
- DADA: Distribution-Aware Domain Adaptation of PLMs for Information Retrieval
Dohyeon Lee, Jongyoon Kim, seung-won hwang, Joonsuk Park
- LLMs cannot find reasoning errors, but can correct them given the error location
Gladys Tyen, Hassan Mansoor, Victor Carbune, Peter Chen, Tony Mak
- Investigating the Impact of Data Contamination of Large Language Models in Text-to-SQL translation
Federico Ranaldi, Elena Sofia Ruzzetti, Dario Onorati, Leonardo Ranaldi, Cristina Giannone, Andrea Favalli, Raniero Romagnoli, Fabio Massimo Zanzotto
- ChartCheck: Explainable Fact-Checking over Real-World Chart Images
Mubashara Akhtar, Nikesh Subedi, Vivek Gupta, Sahar Tahmasebi, Oana Cocarascu, Elena Simperl
- CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling
Chenhao Zhang, Renhao Li, Minghuan Tan, Min Yang, Jingwei Zhu, Di Yang, Jiahao Zhao, Guancheng Ye, Chengming Li, Xiping Hu
- Tox-BART: Leveraging Toxicity Attributes for Explanation Generation of Implicit Hate Speech
Neemesh Yadav, Sarah Masud, Vikram Goyal, Md Shad Akhtar, Tanmoy Chakraborty
- TextGenSHAP: Scalable Post-Hoc Explanations in Text Generation with Long Documents
James Enouen, Hootan Nakhost, Sayna Ebrahimi, Sercan O Arik, Yan Liu, Tomas Pfister
- Balanced Data Sampling for Language Model Training with Clustering
Yunfan Shao, Linyang Li, Zhaoye Fei, Hang Yan, Dahua Lin, Xipeng Qiu
- Length Generalization of Causal Transformers without Position Encoding
Jie Wang, Tao Ji, Yuanbin Wu, Hang Yan, Tao Gui, Qi Zhang, Xuanjing Huang, Xiaoling Wang
- Unsupervised Sign Language Translation and Generation
Zhengsheng Guo, Zhiwei He, Wenxiang Jiao, Xing Wang, Rui Wang, Kehai Chen, Zhaopeng Tu, Yong Xu, Min Zhang
- Mitigating Data Scarcity in Semantic Parsing across Languages with the Multilingual Semantic Layer and its Dataset
Abelardo Carlos Martinez Lorenzo, Pere-Lluís Huguet Cabot, Karim Ghonim, Lu Xu, Hee-Soo Choi, Alberte Fernández-Castro, Roberto Navigli
- Efficient Sparse Attention needs Adaptive Token Release
Chaoran zhang, Lixin Zou, Dan Luo, Xiangyang Luo, Zihao Li, Min Tang, Chenliang Li
- Learning Fine-Grained Grounded Citations for Attributed Large Language Models
Lei Huang, Xiaocheng Feng, Weitao Ma, Yuxuan Gu, Weihong Zhong, Xiachong Feng, Weijiang Yu, Weihua Peng, Duyu Tang, Dandan Tu, Bing Qin
- ReLiK: Retrieve and LinK, Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget
Riccardo Orlando, Pere-Lluís Huguet Cabot, Edoardo Barba, Roberto Navigli
- Synergizing Large Language Models and Pre-Trained Smaller Models for Conversational Intent Discovery
Jinggui Liang, Lizi Liao, Hao Fei, Jing Jiang
- FENICE: Factuality Evaluation of summarization based on Natural language Inference and Claim Extraction
Alessandro Scirè, Karim Ghonim, Roberto Navigli
- Looking Right is Sometimes Right: Investigating the Capabilities of Decoder-only LLMs for Sequence Labeling
David Dukić, Jan Snajder
- mCSQA: Multilingual Commonsense Reasoning Dataset with Unified Creation Strategy by Language Models and Humans
Yusuke Sakai, Hidetaka Kamigaito, Taro Watanabe
- Dual-Stage Multi-Task Syntax-Oriented Pre-Training for Syntactically Controlled Paraphrase Generation
Hongxu Liu, Xiaojie Wang, Jiashen Sun, Ke Zeng, Wan Guanglu
- Demonstration Augmentation for Zero-shot In-context Learning
Yi Su, Yunpeng Tai, Yixin Ji, Juntao Li, Yan Bowen, Min Zhang
- Pushing the Limits of Zero-shot End-to-End Speech Translation
Ioannis Tsiamas, Gerard I. Gállego, José A.R. Fonollosa, Marta R. Costa-jussà
- NUMCoT: Numerals and Units of Measurement in Chain-of-Thought Reasoning using Large Language Models
Ancheng Xu, Minghuan Tan, Lei Wang, Min Yang, Ruifeng Xu
- Evaluating Large Language Models for Health-related Queries with Presuppositions
Navreet Kaur, Monojit Choudhury, Danish Pruthi
- Word Sense Linking: Disambiguating Outside the Sandbox
Andrei Stefan Bejgu, Edoardo Barba, Luigi Procopio, Alberte Fernández-Castro, Roberto Navigli
- Generalisation First, Memorisation Second? Memorisation Localisation for Natural Language Classification Tasks
Verna Dankers, Ivan Titov
- Towards Multi-Relational Multi-Hop Reasoning over Dense Temporal Knowledge Graphs
Jian Liu, Zihe Liu, Xueqiang LYU, Peng Jin, Jinan Xu
- Unsupervised Real-Time Hallucination Detection based on the Internal States of Large Language Models
Weihang Su, Changyue Wang, Qingyao Ai, Yiran HU, Zhijing Wu, Yujia Zhou, Yiqun LIU
- Progressive Tuning: Towards Generic Sentiment Abilities for Large Language Models
Guiyang Hou, Yongliang Shen, Weiming Lu
- Fooling the Textual Fooler via Randomizing Latent Representations
Duy Cao Hoang, Nguyen Hung-Quang, Saurav Manchanda, Minlong Peng, Kok-Seng Wong, Khoa D Doan
- FOCUS: Forging Originality through Contrastive Use in Self-Plagiarism for Language Models
Kaixin Lan, Tao Fang, Derek F. Wong, Yabo Xu, Lidia S. Chao, Cecilia Guanfang Zhao
- Amanda: Adaptively Modality-Balanced Domain Adaptation for Multimodal Emotion Recognition
Xinxin Zhang, Jun Sun, Simin Hong, Taihao Li
- MedREQAL: Examining Medical Knowledge Recall of Large Language Models via Question Answering
Juraj Vladika, Phillip Schneider, Florian Matthes
- Deepfake Defense: Constructing and Evaluating a Specialized Urdu Deepfake Audio Dataset
Sheza Munir, Wassay Sajjad, Mukeet Raza, Emaan Mujahid Abbas, Abdul Hameed Azeemi, Ihsan Ayyub Qazi, Agha Ali Raza
- Recognizing Everything from All Modalities at Once: Grounded Multimodal Universal Information Extraction
Meishan Zhang, Hao Fei, Bin Wang, Shengqiong Wu, Yixin Cao, Fei Li, Min Zhang
- Enhanced Visual Instruction Tuning with Synthesized Image-Dialogue Data
Yanda Li, Chi Zhang, Gang Yu, Wanqi Yang, Zhibin Wang, BIN FU, Guosheng Lin, Chunhua Shen, Ling Chen, Yunchao Wei
- Modeling Overregularization in Children with Small Language Models
Akari Haga, Saku Sugawara, Akiyo Fukatsu, Miyu Oba, Hiroki Ouchi, Taro Watanabe, Yohei Oseki
- Harnessing Large Language Models as Post-hoc Correctors
Zhiqiang Zhong, Kuangyu Zhou, Davide Mottin
- Debatrix: Multi-dimensional Debate Judge with Iterative Chronological Analysis Based on LLM
Jingcong Liang, Rong Ye, Meng Han, Ruofei Lai, Xinyu Zhang, Xuanjing Huang, zhongyu wei
- CycleAlign: Iterative Distillation from Black-box LLM to White-box Models for Better Human Alignment
Jixiang Hong, Quan Tu, Changyu Chen, Gao Xing, Ji Zhang, Rui Yan
- Towards a new research agenda for multimodal enterprise document understanding: What are we missing?
Armineh Nourbakhsh, Sameena Shah, Carolyn Rose
- CAUSE: Counterfactual Assessment of User Satisfaction Estimation in Task-Oriented Dialogue Systems
Amin Abolghasemi, Zhaochun Ren, Arian Askari, Mohammad Aliannejadi, Maarten de Rijke, Suzan Verberne
- Measuring Retrieval Complexity in Question Answering Systems
Matteo Gabburo, Nicolaas Paul Jedema, Siddhant Garg, Leonardo F. R. Ribeiro, Alessandro Moschitti
- Combining Hierachical VAEs with LLMs for clinically meaningful timeline summarisation in social media
Jiayu Song, Jenny Chim, Adam Tsakalidis, Julia Ive, Dana Atzil-Slonim, Maria Liakata
- PIXAR: Auto-Regressive Language Modeling in Pixel Space
Yintao Tai, Xiyang Liao, Alessandro Suglia, Antonio Vergari
- Sparsity-Accelerated Training for Large Language Models
Da Ma, Lu Chen, Pengyu Wang, Hongshen Xu, Hanqi Li, Liangtai Sun, Su Zhu, Shuai Fan, Kai Yu
- Do Language Models Exhibit Human-like Structural Priming Effects?
Jaap Jumelet, Willem Zuidema, Arabella Sinclair
- RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models
Noah Wang, Z.Y. Peng, Haoran Que, Jiaheng Liu, Wangchunshu Zhou, Yuhan Wu, Hongcheng Guo, Ruitong Gan, Zehao Ni, Jian Yang, Man Zhang, Zhaoxiang Zhang, Wanli Ouyang, Ke Xu, Wenhao Huang, Jie Fu, Junran Peng
- LangSuit·E: Planning, Controlling and Interacting with Large Language Models in Embodied Text Environments
Zixia Jia, Mengmeng Wang, Baichen Tong, Song-Chun Zhu, Zilong Zheng
- MAPLE: Multilingual Evaluation of Parameter Efficient Finetuning of Large Language Models
Divyanshu Aggarwal, Ashutosh Sathe, Ishaan Watts, Sunayana Sitaram
- MoE-SLU: Towards ASR-Robust Spoken Language Understanding via Mixture-of-Experts
Xuxin Cheng, Zhihong Zhu, Xianwei Zhuang, Zhanpeng Chen, Zhiqi Huang, Yuexian Zou
- Multi-Task Transfer Matters During Instruction-Tuning
David Mueller, Mark Dredze, Nicholas Andrews
- What Makes a Good Order of Examples in In-Context Learning
Qi Guo, Leiyu Wang, Yidong Wang, Wei Ye, Shikun Zhang
- BloomVQA: Assessing Hierarchical Multi-modal Comprehension
Yunye Gong, Robik Singh Shrestha, Jared Claypoole, Michael Cogswell, Arijit Ray, Christopher Kanan, Ajay Divakaran
- AttributionBench: How Hard is Automatic Attribution Evaluation?
Yifei Li, Xiang Yue, Zeyi Liao, Huan Sun
- Diffusion Guided Language Modeling
Justin Lovelace, Varsha Kishore, Yiwei Chen, Kilian Q Weinberger
- InstructEd: Soft-Instruction Tuning for Model Editing with Hops
XiaoQi Han, Ru Li, Xiaoli Li, Jiye Liang, Zifang Zhang, Jeff Z. Pan
- TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback
Eunseop Yoon, Hee Suk Yoon, SooHwan Eom, Gunsoo Han, Daniel Wontae Nam, Daejin Jo, Kyoung-Woon On, Mark A. Hasegawa-Johnson, Sungwoong Kim, Chang D. Yoo
- Found in the middle: Calibrating Positional Attention Bias Improves Long Context Utilization
Cheng-Yu Hsieh, Yung-Sung Chuang, Chun-Liang Li, Zifeng Wang, Long Le, Abhishek Kumar, James R. Glass, Alexander Ratner, Chen-Yu Lee, Ranjay Krishna, Tomas Pfister
- S3-DST: Structured Open-Domain Dialogue Segmentation and State Tracking in the Era of LLMs
Sarkar Snigdha Sarathi Das, Chirag Shah, Mengting Wan, Jennifer Neville, Longqi Yang, Reid Andersen, Georg Buscher, Tara Safavi
- Set the Clock: Temporal Alignment of Pretrained Language Models
Bowen Zhao, Zander Brumbaugh, Yizhong Wang, Hannaneh Hajishirzi, Noah A. Smith
- From One to Many: Expanding the Scope of Toxicity Mitigation in Language Models
Beyza Ermis, Luiza Amador Pozzobon, Sara Hooker, Patrick Lewis
- Here’s a Free Lunch: Sanitizing Backdoored Models with Model Merge
Ansh Arora, Xuanli He, Maximilian Mozes, Srinibas Swain, Mark Dras, Qiongkai Xu
- Enhancing Sentence Simplification in Portuguese: Leveraging Paraphrases, Context, and Linguistic Features
ARTHUR MARIANO ROCHA DE AZEVEDO SCALERCIO, Maria José Bocorny Finatto, Aline Paes
- How Far can 100 Samples Go? Unlocking Zero-Shot Translation with Tiny Multi-Parallel Data
Di Wu, Shaomu Tan, Yan Meng, David Stap, Christof Monz
- Toward Reliable Ad-hoc Scientific Information Extraction: A Case Study on Two Materials Dataset
Satanu Ghosh, Neal R Brodnik, Carolina Frey, Collin Holgate, Tresa Pollock, Samantha Daly, Samuel Carton
- Structural Optimization Ambiguity and Simplicity Bias in Unsupervised Neural Grammar Induction
Jinwook Park, Kangil Kim
- LMDX: Language Model-based Document Information Extraction and Localization
Vincent Perot, Kai Kang, Florian Luisier, Guolong Su, Xiaoyu Sun, Ramya Sree Boppana, Zilong Wang, Zifeng Wang, Jiaqi Mu, Hao Zhang, Chen-Yu Lee, Nan Hua
- DBQR-QA: A Question Answering Dataset on a Hybrid of Database Querying and Reasoning
Rungsiman Nararatwong, Chung-Chi Chen, Natthawut Kertkeidkachorn, Hiroya Takamura, Ryutaro Ichise
- NoteChat: A Dataset of Synthetic Patient-Physician Conversations Conditioned on Clinical Notes
Junda Wang, Zonghai Yao, Zhichao Yang, Huixue Zhou, Rumeng Li, Xun Wang, Yucheng XU, hong yu
- Model Editing at Scale leads to Gradual and Catastrophic Forgetting
Akshat Gupta, Anurag Rao, Gopala Anumanchipalli
- 3MVRD: Multimodal Multi-task Multi-teacher Visually-Rich Form Document Understanding
Yihao Ding, Lorenzo Vaiani, Caren Han, Jean Lee, Paolo Garza, Josiah Poon, Luca Cagliero
- Faithful Persona-based Conversational Dataset Generation with Large Language Models
Pegah Jandaghi, Xianghai Sheng, Xinyi Bai, Jay Pujara, Hakim Sidahmed
- Vision-Flan: Scaling Human-Labeled Tasks in Visual Instruction Tuning
Zhiyang Xu, Chao Feng, Rulin Shao, Trevor Ashby, Ying Shen, Di Jin, Yu Cheng, Qifan Wang, Lifu Huang
- Challenges to Evaluating the Generalization of Coreference Resolution Models: A Measurement Modeling Perspective
Ian Porada, Alexandra Olteanu, Kaheer Suleman, Adam Trischler, Jackie CK Cheung
- SAGA: A Participant-specific Examination of Story Alternatives and Goal Applicability for a Deeper Understanding of Complex Events
Sai P Vallurupalli, Katrin Erk, Francis Ferraro
- SLIDE: A Framework Integrating Small and Large Language Models for Open-Domain Dialogues Evaluation
Kun Zhao, Bohao Yang, Chen Tang, Chenghua Lin, Liang Zhan
- Deep Exploration of Cross-Lingual Zero-Shot Generalization in Instruction Tuning
Janghoon Han, Changho Lee, Joongbo Shin, Stanley Jungkyu Choi, Honglak Lee, Kyunghoon Bae
- What Makes Language Models Good-enough?
Daiki Asami, Saku Sugawara
- Refining Corpora from a Model Calibration Perspective for Chinese Spelling Correction
Dingyao Yu, Yang An, Wei Ye, xiongfeng xiao, Shaoguang Mao, Tao Ge, Shikun Zhang
- CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples
Jianrui Zhang, Mu Cai, Tengyang Xie, Yong Jae Lee
- Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation with Large Language Models
Ran Xu, Hejie Cui, Yue Yu, Xuan Kan, Wenqi Shi, Yuchen Zhuang, May Dongmei Wang, Wei Jin, Joyce C. Ho, Carl Yang
- Textless Acoustic Model with Self-Supervised Distillation for Noise-Robust Expressive Speech-to-Speech Translation
Min-Jae Hwang, Ilia Kulikov, Benjamin N Peloquin, Hongyu Gong, Peng-Jen Chen, Ann Lee
- Knowledge-Infused Legal Wisdom: Navigating LLM Consultation through the Lens of Diagnostics and Positive-Unlabeled Reinforcement Learning
Yang Wu, Chenghao Wang, Ece Gumusel, Xiaozhong Liu
- TELLER: A Trustworthy Framework for Explainable, Generalizable and Controllable Fake News Detection
Hui Liu, Wenya Wang, Haoru Li, Haoliang Li
- Tailoring with Targeted Precision: Edit-Based Agents for Open-Domain Procedure Customization
Yash Kumar Lal, Li Zhang, Faeze Brahman, Bodhisattwa Prasad Majumder, Peter Clark, Niket Tandon
- A Meta-Learning Perspective on Transformers for Causal Language Modeling
Xinbo Wu, Lav R. Varshney
- PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs
Rongzhi Zhang, Jiaming Shen, Tianqi Liu, Haorui Wang, Zhen Qin, feng han, Jialu Liu, Simon Baumgartner, Michael Bendersky, Chao Zhang
- Small Language Models Need Strong Verifiers to Self-Correct Reasoning
Yunxiang Zhang, Muhammad Khalifa, Lajanugen Logeswaran, Jaekyeom Kim, Moontae Lee, Honglak Lee, Lu Wang
- Hire a Linguist!: Learning Endangered Languages in LLMs with In-Context Linguistic Descriptions
Kexun Zhang, Yee Man Choi, Zhenqiao Song, Taiqi He, William Yang Wang, Lei Li
- From Tarzan to Tolkien: Controlling the Language Proficiency Level of LLMs for Content Generation
Ali Malik, Stephen Mayhew, Christopher J Piech, Klinton Bicknell
- From Representational Harms to Quality-of-Service Harms: A Case Study on Llama 2 Safety Safeguards
Khaoula Chehbouni, Megha Roshan, Emmanuel Ma, Futian Andrew Wei, Afaf Taik, Jackie CK Cheung, Golnoosh Farnadi
- CToolEval: A Chinese Benchmark for LLM-Powered Agent Evaluation in Real-World API Interactions
Zishan Guo, Yufei Huang, Deyi Xiong
- Token Alignment via Character Matching for Subword Completion
Ben Athiwaratkun, Shiqi Wang, Mingyue Shang, YUCHEN TIAN, Zijian Wang, Sujan Kumar Gonugondla, Sanjay Krishna Gouda, Robert Kwiatkowski, Ramesh Nallapati, Parminder Bhatia, Bing Xiang
- emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
Ziyang Ma, Zhisheng Zheng, Jiaxin Ye, Jinchao Li, Zhifu Gao, ShiLiang Zhang, Xie Chen
- Language-Informed Beam Search Decoding for Multilingual Machine Translation
Yilin Yang, Stefan Lee, Prasad Tadepalli
- RA-LoRA: Rank-Adaptive Parameter-Efficient Fine-Tuning for Accurate 2-bit Quantized Large Language Models
Minsoo Kim, Sihwa Lee, Wonyong Sung, Jungwook Choi
- The PGNSC Benchmark: How Do We Predict Where Information Spreads?
Alexander K Taylor, Wei Wang
- STARLING: Self-supervised Training of Text-based Reinforcement Learning Agent with Large Language Models
Shreyas Basavatia, Keerthiram Murugesan, Shivam Ratnakar
- Protecting Privacy Through Approximating Optimal Parameters for Sequence Unlearning in Language Models
Dohyun Lee, Daniel Rim, Minseok Choi, Jaegul Choo
- Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding
Xintong Wang, Jingheng Pan, Liang Ding, Chris Biemann
- Fine-tuning Language Models for Joint Rewriting and Completion of Code with Potential Bugs
Dingmin Wang, Jinman Zhao, Hengzhi Pei, Samson Tan, Sheng Zha
- A Critical Study of What Code-LLMs (Do Not) Learn
Abhinav Anand, Shweta Verma, Krishna Narasimhan, Mira Mezini
- Visual In-Context Learning for Large Vision-Language Models
Yucheng Zhou, Xiang Li, Qianning Wang, Jianbing Shen
- SCALE: Synergized Collaboration of Asymmetric Language Translation Engines
Xin Cheng, Xun Wang, Tao Ge, Si-Qing Chen, Furu Wei, Dongyan Zhao, Rui Yan
- No perspective, no perception!! Perspective-aware Healthcare Answer Summarization
Gauri Naik, Sharad Chandakacherla, Shweta Yadav, Md Shad Akhtar
- Retrieval-Augmented Retrieval: Large Language Models are Strong Zero-Shot Retriever
Tao Shen, Guodong Long, Xiubo Geng, Chongyang Tao, Yibin Lei, Tianyi Zhou, Michael Blumenstein, Daxin Jiang
- A Survey on Predicting the Factuality and the Bias of News Media
Preslav Nakov, Jisun An, Haewoon Kwak, Muhammad Arslan Manzoor, Zain Muhammad Mujahid, Husrev Taha Sencar
- Semantic Compression for Word and Sentence Embeddings using Discrete Wavelet Transform
Rana Salama, Abdou Youssef, Mona T. Diab
- Improving Multi-hop Logical Reasoning in Knowledge Graphs with Context-Aware Query Representation Learning
Jeonghoon Kim, Heesoo Jung, Hyeju Jang, Hogun Park
- ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models
Yuzhao Heng, Chunyuan Deng, Yitong Li, Yue Yu, Yinghao Li, Rongzhi Zhang, Chao Zhang
- Defending LLMs against Jailbreaking Attacks via Backtranslation
Yihan Wang, Zhouxing Shi, Andrew Bai, Cho-Jui Hsieh
- A Large Collection of Model-generated Contradictory Responses for Consistency-aware Dialogue Systems
Shiki Sato, Reina Akama, Jun Suzuki, Kentaro Inui
- Exploring Reasoning Biases in Large Language Models Through Syllogism: Insights from the NeuBAROCO Dataset
Kentaro Ozeki, Risako Ando, Takanobu Morishita, Hirohiko Abe, Koji Mineshima, Mitsuhiro Okada
- Unveiling the Spectrum of Data Contamination in Language Model: A Survey from Detection to Remediation
Chunyuan Deng, Yilun Zhao, Yuzhao Heng, Yitong Li, Jiannan Cao, Xiangru Tang, Arman Cohan
- DIMSIM: Distilled Multilingual Critics for Indic Text Simplification
Sneha Mondal, Ritika, Ashish Sunil Agrawal, Preethi Jyothi, Aravindan Raghuveer
- MATTER: Memory-Augmented Transformer Using Heterogeneous Knowledge Sources
Dongkyu Lee, Chandana Satya Prakash, Jack FitzGerald, Jens Lehmann
- Ask LLMs Directly, “What shapes your bias?”: Measuring Social Bias in Large Language Models
Jisu Shin, Hoyun Song, Huije Lee, Soyeong Jeong, Jong C. Park
- Chain-of-History Reasoning for Temporal Knowledge Graph Forecasting
Yuwei Xia, Ding Wang, Qiang Liu, Liang Wang, Shu Wu, Xiao-Yu Zhang
- Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate Controllable Controversial Statements
Ming Li, Jiuhai Chen, Lichang Chen, Tianyi Zhou
- Label-aware Hard Negative Sampling Strategies with Momentum Contrastive Learning for Implicit Hate Speech Detection
Jaehoon Kim, Seungwan Jin, Sohyun Park, Someen Park, Kyungsik Han
- Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
Ming Li, Lichang Chen, Jiuhai Chen, Shwai He, Jiuxiang Gu, Tianyi Zhou
- Selective Prompting Tuning for Personalized Conversations with LLMs
Qiushi Huang, Xubo Liu, Tom Ko, Bo Wu, Wenwu Wang, Yu Zhang, Lilian Tang
- Sowing the Wind, Reaping the Whirlwind: The Impact of Editing Language Models
Rima Hazra, Sayan Layek, Somnath Banerjee, Soujanya Poria
- ContextBLIP: Doubly Contextual Alignment for Contrastive Image Retrieval from Linguistically Complex Descriptions
Honglin Lin, Siyu Li, Guoshun Nan, Chaoyue Tang, Xueting Wang, Jingxin Xu, Rong Yankai, zhouzhili, Yutong Gao, Qimei Cui, Xiaofeng Tao
- PuzzleVQA: Diagnosing Multimodal Reasoning Challenges of Language Models with Abstract Visual Patterns
Yew Ken Chia, Vernon Toh, Deepanway Ghosal, Lidong Bing, Soujanya Poria
- How Do Moral Emotions Shape Political Participation? A Cross-Cultural Analysis of Online Petitions Using Language Models
Jaehong Kim, Chaeyoon Jeong, Seongchan Park, Meeyoung Cha, Wonjae Lee
- VillagerAgent: A Graph-Based Multi-Agent Framework for Coordinating Complex Task Dependencies in Minecraft
Yubo Dong, Xukun Zhu, Zhengzhe Pan, Linchao Zhu, Yi Yang
- CF-TCIR: A Compositor-Free Framework for Hierarchical Text-Conditioned Image Retrieval
Yuchen Yang, Yu Wang, Yanfeng Wang
- DMIN: A Discourse-specific Multi-granularity Integration Network for Conversational Aspect-based Sentiment Quadruple Analysis
Peijie Huang, Xisheng Xiao, Yuhong Xu, Jiawei Chen
- FragRel: Exploiting Fragment-level Relations in the External Memory of Large Language Models
Xihang Yue, Linchao Zhu, Yi Yang
- On the Robustness of Document-Level Relation Extraction Models to Entity Name Variations
Shiao Meng, Xuming Hu, Aiwei Liu, Fukun Ma, Yawen Yang, Shuang Li, Lijie Wen
- RESEMO: A Benchmark Chinese Dataset for Studying Responsive Emotion from Social Media Content
Bo Hu, Meng Zhang, Chenfei Xie, Yuanhe Tian, Yan Song, Zhendong Mao
- EHR-SeqSQL : A Sequential Text-to-SQL Dataset For Interactively Exploring Electronic Health Records
Jaehee Ryu, Seonhee Cho, Gyubok Lee, Edward Choi
- RePair: Automated Program Repair with Process-based Feedback
Yuze Zhao, Zhenya Huang, Yixiao Ma, Rui Li, Kai Zhang, Hao Jiang, Qi Liu, Linbo Zhu, Yu Su
- Concise and Precise Context Compression for Tool-Using Language Models
Yang Xu, Yunlong Feng, Honglin Mu, Yutai Hou, Yitong Li, Xinghao Wang, Wanjun Zhong, Zhongyang Li, Dandan Tu, Qingfu Zhu, Min Zhang, Wanxiang Che
- MedDec: A Dataset for Extracting Medical Decisions from Discharge Summaries
Mohamed Elgaar, Jiali Cheng, Nidhi Vakil, Hadi Amiri, Leo Anthony Celi
Short Papers
- AFPQ: Asymmetric Floating Point Quantization for LLMs
Yijia Zhang, Sicheng Zhang, Shijie Cao, DaYou Du, Jianyu Wei, Ting Cao, Ningyi Xu
- A Grounded Preference Model for LLM Alignment
Tahira Naseem, Guangxuan Xu, Sarathkrishna Swaminathan, Asaf Yehudai, Subhajit Chaudhury, Radu Florian, Ramón Fernandez Astudillo, Asim Munawar
- How Important is a Language Model for Low-resource ASR?
Zoey Liu, Nitin Venkateswaran, Eric Le Ferrand, Emily Prud’hommeaux
- InfiMM: Advancing Multimodal Understanding with an Open-Sourced Visual Language Model
Haogeng Liu, Quanzeng You, Yiqi Wang, Xiaotian Han, Bohan Zhai, Yongfei Liu, Wentao Chen, Yiren Jian, Yunzhe Tao, Jianbo Yuan, Ran He, Hongxia Yang
- Effective In-Context Example Selection through Data Compression
ZhongXiang Sun, Kepu Zhang, Haoyu Wang, Xiao Zhang, Jun Xu
- Exploring Mathematical Extrapolation of Large Language Models with Synthetic Data
Haolong Li, Yu Ma, Yinqi Zhang, Chen Ye, Jie chen
- Realistic Evaluation of Toxicity in Large Language Models
Tinh Son Luong, Thanh-Thien Le, Linh Van Ngo, Thien Huu Nguyen
- Learning Job Title Representation from Job Description Aggregation Network
Napat Laosaengpha, Thanit Tativannarat, Chawan Piansaddhayanon, Attapol Rutherford, Ekapol Chuangsuwanich
- Flexible Weight Tuning and Weight Fusion Strategies for Continual Named Entity Recognition
Yahan Yu, Duzhen Zhang, Xiuyi Chen, Chenhui Chu
- An Empirical Study on the Characteristics of Bias upon Context Length Variation for Bangla
Jayanta Sadhu, Ayan Antik Khan, Abhik Bhattacharjee, Rifat Shahriyar
- SPAGHETTI: Open-Domain Question Answering from Heterogeneous Data Sources with Retrieval and Semantic Parsing
Heidi Chenyu Zhang, Sina Semnani, Farhad Ghassemi, Jialiang Xu, Shicheng Liu, Monica Lam
- k-SemStamp: A Clustering-Based Semantic Watermark for Detection of Machine-Generated Text
Abe Bohan Hou, Jingyu Zhang, Yichen Wang, Daniel Khashabi, Tianxing He
- ColorSwap: A Color and Word Order Dataset for Multimodal Evaluation
Jirayu Burapacheep, Ishan Gaur, Agam Bhatia, Tristan Thrush
- Revisiting OPRO: The Limitations of Small-Scale LLMs as Optimizers
Tuo Zhang, Jinyue Yuan, Salman Avestimehr
- A Shocking Amount of the Web is Machine Translated: Insights from Multi-Way Parallelism
Brian Thompson, Mehak Preet Dhaliwal, Peter Frisch, Tobias Domhan, Marcello Federico
- RankMean: Module-Level Importance Score for Merging Fine-tuned LLM Models
Gabriel Jacob Perin, Xuxi Chen, Shusen Liu, Bhavya Kailkhura, Zhangyang Wang, Brian Gallagher
- DEBATE: Devil’s Advocate-Based Assessment and Text Evaluation
Alex G. Kim, Keonwoo Kim, Sangwon Yoon
- SocialBench: Sociality Evaluation of Role-Playing Conversational Agents
Hongzhan Chen, Hehong Chen, Ming Yan, Wenshen Xu, Gao Xing, Weizhou Shen, Xiaojun Quan, Chenliang Li, Ji Zhang, Fei Huang
- From Model-centered to Human-Centered: Revision Distance as a Metric for Text Evaluation in LLMs-based Applications
Yongqiang Ma, Lizhi Qing, Jiawei Liu, Yangyang Kang, Yue Zhang, Wei Lu, Xiaozhong Liu, Qikai Cheng
- VISPool: Enhancing Transformer Encoders with Vector Visibility Graph Neural Networks
Tuna Alikaşifoğlu, Arda Can Aras, Aykut Koc
- Accurate and Nuanced Open-QA Evaluation Through Textual Entailment
Peiran Yao, Denilson Barbosa
- Dictionary-Aided Translation for Handling Multi-Word Expressions in Low-Resource Languages
Antonios Dimakis, Stella Markantonatou, Antonios Anastasopoulos
- Selective Prefix Tuning for Pre-trained Language Models
Hongyi Zhang, Zuchao Li, Ping Wang, hai zhao
- Towards Better Utilization of Multi-Reference Training Data for Chinese Grammatical Error Correction
Yumeng Liu, Zhenghua Li, HaoChen Jiang, Bo Zhang, Chen Li, Ji Zhang
- Concept-Best-Matching: Evaluating Compositionality In Emergent Communication
Boaz Carmeli, Yonatan Belinkov, Ron Meir
- Pro-Woman, Anti-Man? Identifying Gender Bias in Stance Detection
Yingjie Li, Yue Zhang
- Likelihood-based Mitigation of Evaluation Bias in Large Language Models
Masanari Ohi, Masahiro Kaneko, Ryuto Koike, Mengsay Loem, Naoaki Okazaki
- Aligning Speech Segments Beyond Pure Semantics
Kevin Heffernan, Artyom Kozhevnikov, Loic Barrault, Alexandre Mourachko, Holger Schwenk
- Improving In-Context Learning with Prediction Feedback for Sentiment Analysis
Hongling Xu, Qianlong Wang, Yice Zhang, Min Yang, Xi Zeng, Bing Qin, Ruifeng Xu
- MovieSum: An Abstractive Summarization Dataset for Movie Screenplays
Rohit Saxena, Frank Keller
- Context Length Extension via Generalized Extrapolation Scale
Linhan Li, Zhang Huaping
- Selectively Answering Visual Questions
Julian Martin Eisenschlos, Hernán Maina, Guido Ivetta, Luciana Benotti
- Semantics or spelling? Probing contextual word embeddings with orthographic noise
Jacob A. Matthews, John R Starr, Marten Van Schijndel
- Automated Detection and Analysis of Data Practices Using A Real-World Corpus
Mukund Srinath, Pranav Narayanan Venkit, Maria Badillo, Florian Schaub, C. Lee Giles, Shomir Wilson
- Incorporating Syntax and Lexical Knowledge to Multilingual Sentiment Classification on Large Language Models
Hiroshi Kanayama, YANG ZHAO, Ran Iwamoto, Takuya Ohko
- Which Information Matters? Dissecting Human-written Multi-document Summaries with Partial Information Decomposition
Laura Mascarell, Yan LHomme, Majed El Helou
- Evaluating Large Language Models on Wikipedia-Style Survey Generation
Fan Gao, Hang Jiang, Rui Yang, Qingcheng Zeng, Jinghui Lu, Moritz Blum, Tianwei She, Yuang Jiang, Irene Li
- Predicting Narratives of Climate Obstruction in Social Media Advertising
Harri Rowlands, Gaku Morio, Dylan Tanner, Christopher D Manning
- Model Editing by Standard Fine-Tuning
Govind Krishnan Gangadhar, Karl Stratos
- Mitigating Hallucinations in Large Vision-Language Models (LVLMs) via Language-Contrastive Decoding (LCD)
Avshalom Manevich, Reut Tsarfaty
- HeSum: a Novel Dataset for Abstractive Text Summarization in Hebrew
Tzuf Paz-Argaman, Itai Mondshine, Asaf Achi Mordechai, Reut Tsarfaty
- It is Simple Sometimes: A Study On Improving Aspect-Based Sentiment Analysis Performance
Laura Cabello, Uchenna Akujuobi
- RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering
Zihan Zhang, Meng Fang, Ling Chen
- LEIA: Facilitating Cross-lingual Knowledge Transfer in Language Models with Entity-based Data Augmentation
Ikuya Yamada, Ryokan Ri
- Do Zombies Understand? A Choose-Your-Own-Adventure Exploration of Machine Cognition
Ariel Goldstein, Gabriel Stanovsky
- Self-Consistent Reasoning-based Aspect-Sentiment Quad Prediction with Extract-Then-Assign Strategy
Jieyong Kim, Ryang Heo, Yongsik Seo, SeongKu Kang, Jinyoung Yeo, Dongha Lee
- “My Answer is C”: First-Token Probabilities Do Not Match Text Answers in Instruction-Tuned Language Models
Xinpeng Wang, Bolei Ma, Chengzhi Hu, Leon Weber-Genzel, Paul Röttger, Frauke Kreuter, Dirk Hovy, Barbara Plank
- Building Bridges: A Dataset for Evaluating Gender-Fair Machine Translation into German
Manuel Lardelli, Giuseppe Attanasio, Anne Lauscher
- Prompt Chaining or Stepwise Prompt? Refinement in Text Summarization
Shichao Sun, Ruifeng Yuan, Ziqiang Cao, Wenjie Li, Pengfei Liu
- From Zero to Hero: Cold-Start Anomaly Detection
Tal Reiss, George Kour, Naama Zwerdling, Ateret Anaby Tavor, Yedid Hoshen
- LSTPrompt: Large Language Models as Zero-Shot Time Series Forecasters by Long-Short-Term Prompting
haoxin liu, Zhiyuan Zhao, Jindong Wang, Harshavardhan Kamarthi, B. Aditya Prakash
- The State of Relation Extraction Data Quality: Is Bigger Always Better?
Erica Cai, Brendan O’Connor
- Linear Cross-Lingual Mapping of Sentence Embeddings
Oleg Vasilyev, Fumika Isono, John Bohannon
- ULTRA: Unleash LLMs’ Potential for Event Argument Extraction through Hierarchical Modeling and Pair-wise Self-Refinement
Xinliang Frederick Zhang, Carter Blum, Temma Choji, Shalin Shah, Alakananda Vempala
- Exploring Domain Robust Lightweight Reward Models based on Router Mechanism
Hyuk Namgoong, Jeesu Jung, Sangkeun Jung, YoonHyung Roh
- “Get Their Hands Dirty, Not Mine’’: On Researcher-Annotator Collaboration and the Agency of Annotators
Shengqi Zhu, Jeffrey Rzeszotarski
- GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation
Yi Zong, Xipeng Qiu
- Revisiting Parallel Context Windows: A Frustratingly Simple Alternative and Chain-of-Thought Deterioration
Kejuan Yang, Xiao Liu, Kaiwen Men, Aohan Zeng, Yuxiao Dong, Jie Tang
- Large Language Models Can Learn Representation in Natural Language
Yiduo Guo, Yaobo Liang, Dongyan Zhao, Nan Duan
- CTC-based Non-autoregressive Textless Speech-to-Speech Translation
Qingkai Fang, Zhengrui Ma, Yan Zhou, Min zhang, Yang Feng
- Evidence Retrieval is almost All You Need for Fact Verification
Liwen Zheng, Chaozhuo Li, Xi Zhang, Yu-Ming Shang, Feiran Huang, Haoran Jia
- Pushing the Limits of Low-Resource NER Using LLM Artificial Data Generation
Joan Santoso, Patrick Sutanto, Billy Kelvianto Cahyadi, Esther Irawati Setiawan
- Integrating Multi-scale Contextualized Information for Byte-based Neural Machine Translation
Langlin Huang, Yang Feng
- MELD-ST: An Emotion-aware Speech Translation Dataset
Sirou Chen, Sakiko Yahata, Shuichiro Shimizu, Zhengdong Yang, Yihang Li, Chenhui Chu, Sadao Kurohashi
- Designing Informative Metrics for Few-Shot Example Selection
Rishabh Adiga, Lakshmi Subramanian, Varun Chandrasekaran
- Chain-of-Quizzes: Pedagogy-inspired Example Selection in In-Context-Learning
Yiquan Wu, Anlai Zhou, Yuhang Liu, Yifei Liu, Adam Jatowt, Weiming Lu, Jun Xiao, Kun Kuang
- It’s Not Easy Being Wrong: Large Language Models Struggle with Process of Elimination Reasoning
Nishant Balepur, Shramay Palta, Rachel Rudinger
- Centroid-Based Efficient Minimum Bayes Risk Decoding
Hiroyuki Deguchi, Yusuke Sakai, Hidetaka Kamigaito, Taro Watanabe, Hideki Tanaka, Masao Utiyama
- When is a Language Process a Language Model?
Li Du, Holden Lee, Jason Eisner, Ryan Cotterell
- Definition Generation for Automatically Induced Semantic Frame
Yi Han, Ryohei Sasano, Koichi Takeda
- Don’t Augment, Rewrite? Assessing Abusive Language Detection with Synthetic Data
Camilla Casula, Elisa Leonardelli, Sara Tonelli
- AustroTox: A Dataset for Target-Based Austrian German Offensive Language Detection
Pia Pachinger, Janis Goldzycher, Anna Maria Planitzer, Wojciech Kusa, Allan Hanbury, Julia Neidhardt
- LC4EE: LLMs as Good Corrector for Event Extraction
Mengna Zhu, Kaisheng Zeng, JibingWu, Lihua Liu, Hongbin Huang, Lei Hou, Juanzi Li
- Beyond Text: Leveraging Multi-Task Learning and Cognitive Appraisal Theory for Post-Purchase Intention Analysis
Gerard Christopher Yeo, Shaz Furniturewala, Kokil Jaidka
- Diving Deep into the Motion Representation of Video-Text Models
Chinmaya Devaraj, Cornelia Fermuller, Yiannis Aloimonos
- Argument-Aware Approach To Event Linking
I-Hung Hsu, Zihan Xue, Nilay Pochhi, Sahil Bansal, Prem Natarajan, Jayanth Srinivasa, Nanyun Peng
- Understanding the Impacts of Language Technologies’ Performance Disparities on African American Language Speakers
Jay L. Cunningham, Su Lin Blodgett, Michael A. Madaio, Hal Daumé III, Christina Harrington, Hanna Wallach
- Language Model Priors and Data Augmentation Strategies for Low-resource Machine Translation: A Case Study Using Finnish to Northern Sámi
Jonne Sälevä, Constantine Lignos
- Few-shot Dialogue Strategy Learning for Motivational Interviewing via Inductive Reasoning
Zhouhang Xie, Bodhisattwa Prasad Majumder, Mengjie Zhao, Yoshinori Maeda, Keiichi Yamada, Hiromi Wakaki, Julian McAuley
- Enhancing Hallucination Detection through Perturbation-Based Synthetic Data Generation in System Responses
Dongxu Zhang, Varun Prashant Gangal, Barrett Martin Lattimer, Yi Yang
- Referral Augmentation for Zero-Shot Information Retrieval
Michael Tang, Shunyu Yao, John Yang, Karthik R Narasimhan
- Real World Conversational Entity Linking Requires More Than Zero-Shots
Mohanna Hoveyda, Arjen P. de Vries, Faegheh Hasibi, Maarten de Rijke
- Self-Para-Consistency: Improving Reasoning Tasks at Low Cost for Large Language Models
Wenqing Chen, Weicheng Wang, Zhixuan Chu, Kui Ren, Zibin Zheng, Zhichao Lu
- On The Persona-based Summarization of Domain-Specific Documents
Ankan Mullick, Sombit Bose, Rounak Saha, Ayan Kumar Bhowmick, Pawan Goyal, Niloy Ganguly, Prasenjit Dey, Ravi Kokku
- Part-of-speech Tagging for Extremely Low-resource Indian Languages
Sanjeev Kumar, Preethi Jyothi, Pushpak Bhattacharyya
- Leveraging Entailment Judgements in Cross-Lingual Summarisation
Huajian Zhang, Laura Perez-Beltrachini
- Fantastic Semantics and Where to Find Them: Investigating Which Layers of Generative LLMs Reflect Lexical Semantics
Zhu Liu, Cunliang Kong, Ying Liu, Maosong Sun
- Preemptive Answer “Attacks” on Chain-of-Thought Reasoning
Rongwu Xu, Zehan Qi, Wei Xu
- Views Are My Own, but Also Yours: Benchmarking Theory of Mind Using Common Ground
Adil Soubki, John Murzaku, Arash Yousefi Jordehi, Peter Zeng, Magdalena Markowska, Seyed Abolghasem Mirroshandel, Owen Rambow
- TAXI: Evaluating Categorical Knowledge Editing for Language Models
Derek Powell, Walter Gerych, Thomas Hartvigsen
- Automatic Bug Detection in LLM-Powered Text-Based Games Using LLMs
Claire Jin, Sudha Rao, XIANGYU PENG, Portia Kwartema Botchway, Jessica Quaye, Chris Brockett, Bill Dolan
- Embodied Language Learning: Opportunities, Challenges, and Future Directions
Nadine Amin, Julia Rayz
- Verifiable Generation with Subsentence-Level Fine-Grained Citations
Shuyang Cao, Lu Wang
- Rethinking Efficient Multilingual Text Summarization Meta-Evaluation
Rilyn R. Han, Jiawen Chen, Yixin Liu, Arman Cohan
- Are Decoder-Only Language Models Better than Encoder-Only Language Models in Understanding Word Meaning?
Muhammad Reza Qorib, Geonsik Moon, Hwee Tou Ng
- KEEP CHATTING! An Attractive Dataset for Continuous Conversation Agents
Yihe Wang, Jin Liu, Yao Wan, Yitong Li, Zifeng Liu, Weipeng Chen