Oral Papers
- InstructCoder: Instruction Tuning Large Language Models for Code Editing
Kaixin Li, Qisheng Hu, James Xu Zhao, Hui Chen, Yuxi Xie, Tiedong Liu, Michael Shieh, Junxian He
- BiasDPO: Mitigating Bias in Language Models through Direct Preference Optimization
Ahmed Allam
- Plot Retrieval as an Assessment of Abstract Semantic Association
Shicheng Xu, Liang Pang, Jiangnan Li, Mo Yu, Fandong Meng, Huawei Shen, Xueqi Cheng, Jie Zhou
- Rescue: Ranking LLM Responses with Partial Ordering to Improve Response Generation
Yikun Wang, Rui Zheng, Haoming Li, Qi Zhang, Tao Gui, Fei Liu
- Curriculum Learning for Small Code Language Models
Marwa Naïr, Kamel Yamani, Lynda Said Lhadj, Riyadh Baghdadi
Poster Papers
- Feriji: A French-Zarma Parallel Corpus, Glossary & Translator
Mamadou K. Keita, Elysabhete Amadou Ibrahim, Habibatou Abdoulaye Alfari, Christopher M Homan
- Pragmatic inference of scalar implicature by LLMs
Ye-eun Cho, Seong mook Kim
- Topic Modeling for Short Texts with Large Language Models
Tomoki Doi, Masaru Isonuma, Hitomi Yanaka
- Can LLMs substitute SQL? Comparing Resource Utilization of Querying LLMs versus Traditional Relational Databases
Xiang Zhang, Khatoon Khedri, Reza Rawassizadeh
- Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer
Yongqi Wang, Bai Jionghao, Rongjie Huang, Ruiqi Li, Zhiqing Hong, Zhou Zhao
- MoExtend: Tuning New Experts for Modality and Task Extension
Shanshan Zhong, Shanghua Gao, Zhongzhan Huang, Wushao Wen, Marinka Zitnik, Pan Zhou
- On the Interpretability of Deep Learning Models for Collaborative Argumentation Analysis in Classrooms
Dliang Wang, Gaowei Chen
- Document Alignment based on Overlapping Fixed-Length Segments
Xiaotian Wang, Takehito Utsuro, Masaaki Nagata
- Automatically Suggesting Diverse Example Sentences for L2 Japanese Learners Using Pre-Trained Language Models
Enrico Benedetti, Akiko Aizawa, Florian Boudin
- Z-coref: Thai Coreference and Zero Pronoun Resolution
Poomphob Suwannapichat, Sansiri Tarnpradab, Santitham Prom-on
- ReMAG-KR: Retrieval and Medically Assisted Generation with Knowledge Reduction for Medical Question Answering
Sidhaarth Murali, Sowmya Kamath, Supreetha R
- Demystifying Instruction Mixing for Fine-tuning Large Language Models
Renxi Wang, Haonan Li, Minghao Wu, Yuxia Wang, Xudong Han, Chiyu Zhang, Timothy Baldwin
- Fine-Tuning ASR models for Very Low-Resource Languages: A Study on Mvskoke
Julia Mainzinger, Gina-Anne Levow
- Automating Qualitative Data Analysis with Large Language Models
Angelina Parfenova, Alexander Denzler, Juergen Pfeffer
- ANHALTEN: Cross-Lingual Transfer for German Token-Level Reference-Free Hallucination Detection
Janek Herrlein, Chia-Chien Hung, Goran Glavaš
- Label-Aware Automatic Verbalizer for Few-Shot Text Classification in Mid-To-Low Resource Languages
Thanakorn Thaminkaew, Piyawat Lertvittayakumjorn, Peerapon Vateekul
- Vector Spaces for Quantifying Disparity of Multiword Expressions in Annotated Text
Louis Estève, Agata Savary, Thomas Lavergne
- Narratives at Conflict: Computational Analysis of News Framing in Multilingual Disinformation Campaigns
Antonina Sinelnik, Dirk Hovy
- Assessing In-context Learning and Fine-tuning for Topic Classification of German Web Data
Julian Schelb, Andreas Spitz, Roberto Ulloa
- Knowledge Editing of Large Language Models Unconstrained by Word Order
Ryoma Ishigaki, Jundai Suzuki, Masaki Shuzo, Eisaku Maeda
- Exploring the Effectiveness and Consistency of Task Selection in Intermediate-Task Transfer Learning
Pin-Jie Lin, Miaoran Zhang, Marius Mosbach, Dietrich Klakow
- Does the structure of textual content have an impact on language models for automatic summarization?
Eve Sauvage, Sabrina Campano, Lydia Ould-Ouali, Cyril Grouin
- Action Inference for Destination Prediction in Vision-and-Language Navigation
Anirudh Reddy Kondapally, Kentaro Yamada, Hitomi Yanaka
- A Computational Analysis and Exploration of Linguistic Borrowings in French Rap Lyrics
Lucas Zurbuchen, Rob Voigt
- On Improving Repository-Level Code QA for Large Language Models
Jan Strich, Florian Schneider, Irina Nikishina, Chris Biemann
- Compromesso! Italian Many-Shot Jailbreaks undermine the safety of Large Language Models
Fabio Pernisi, Dirk Hovy, Paul Röttger
- Foundation Model for Biomedical Graphs: Integrating Knowledge Graphs and Protein Structures to Large Language Models
Yunsoo Kim
- ViMedAQA: A Vietnamese Medical Abstractive Question-Answering Dataset and Findings of Large Language Model
Minh-Nam Tran, Phu-Vinh Nguyen, Long HB Nguyen, Dien Dinh
- Basreh or Basra? Geoparsing Historical Locations in the Svoboda Diaries
Jolie Zhou, Camille Lyans Cole, Annie T. Chen
- Homophone2Vec: Embedding Space Analysis for Empirical Evaluation of Phonological and Semantic Similarity
Sophie Wu, Anita Zheng, Joey Chuang
- Trace-of-Thought Prompting: Investigating Prompt-Based Knowledge Distillation Through Question Decomposition
Tyler McDonald, Ali Emami
- Can LLMs Augment Low-Resource Reading Comprehension Datasets? Opportunities and Challenges
Vinay Samuel, Houda Aynaou, Arijit Ghosh Chowdhury, Karthik Venkat Ramanan, Aman Chadha
- Automatic Derivation of Semantic Representations for Thai Serial Verb Constructions: A Grammar-Based Approach
Vipasha Bansal
- Seed-Free Synthetic Data Generation Framework for Instruction-Tuning LLMs: A Case Study in Thai
Parinthapat Pengpun, Can Udomcharoenchaikit, Weerayut Buaphet, Peerat Limkonchotiwat
- Bridging Distribution Gap via Semantic Rewriting with LLMs to Enhance OOD Robustness
Manas Madine
- CoVoSwitch: Machine Translation of Synthetic Code-Switched Text Based on Intonation Units
Yeeun Kang
- An Analysis under a Unified Formulation of Learning Algorithms with Output Constraints
Mooho Song, Jay-Yoon Lee
- Beyond Abstracts: A New Dataset, Prompt Design Strategy and Method for Biomedical Synthesis Generation
James O’Doherty, Cian Nolan, Yufang Hou, Anya Belz
- Improving Sentence Embeddings with Automatic Generation of Training Data Using Few-shot Examples
Soma Sato, Hayato Tsukagoshi, Ryohei Sasano, Koichi Takeda
- Question-Analysis Prompting Improves LLM Performance in Reasoning Tasks
Dharunish Yugeswardeenoo, Kevin Zhu, Sean O’Brien
- An Individualized News Affective Response Dataset
Tiancheng Hu, Nigel Collier
- How Well Do Vision Models Encode Diagram Attributes?
Haruto Yoshida, Keito Kudo, Yoichi Aoki, Ryota Tanaka, Itsumi Saito, Keisuke Sakaguchi, Kentaro Inui
- CheckersGPT: Learning World Models through Language Modeling
Abhinav Joshi, Vaibhav Sharma, Ashutosh Modi
- In-Context Symbolic Regression: Leveraging Large Language Models for Function Discovery
Matteo Merler, Katsiaryna Haitsiukevich, Nicola Dainese, Pekka Marttinen
- STEP: Staged Parameter-Efficient Pre-training for Large Language Models
Kazuki Yano, Takumi Ito, Jun Suzuki