Accepted SRW Papers

Student Research Workshop Site

Oral Papers

  • InstructCoder: Instruction Tuning Large Language Models for Code Editing
    Kaixin Li, Qisheng Hu, James Xu Zhao, Hui Chen, Yuxi Xie, Tiedong Liu, Michael Shieh, Junxian He
  • BiasDPO: Mitigating Bias in Language Models through Direct Preference Optimization
    Ahmed Allam
  • Plot Retrieval as an Assessment of Abstract Semantic Association
    Shicheng Xu, Liang Pang, Jiangnan Li, Mo Yu, Fandong Meng, Huawei Shen, Xueqi Cheng, Jie Zhou
  • Rescue: Ranking LLM Responses with Partial Ordering to Improve Response Generation
    Yikun Wang, Rui Zheng, Haoming Li, Qi Zhang, Tao Gui, Fei Liu
  • Curriculum Learning for Small Code Language Models
    Marwa Naïr, Kamel Yamani, Lynda Said Lhadj, Riyadh Baghdadi

Poster Papers

  • Feriji: A French-Zarma Parallel Corpus, Glossary & Translator
    Mamadou K. Keita, Elysabhete Amadou Ibrahim, Habibatou Abdoulaye Alfari, Christopher M Homan
  • Pragmatic inference of scalar implicature by LLMs
    Ye-eun Cho, Seong mook Kim
  • Topic Modeling for Short Texts with Large Language Models
    Tomoki Doi, Masaru Isonuma, Hitomi Yanaka
  • Can LLMs substitute SQL? Comparing Resource Utilization of Querying LLMs versus Traditional Relational Databases
    Xiang Zhang, Khatoon Khedri, Reza Rawassizadeh
  • Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer
    Yongqi Wang, Bai Jionghao, Rongjie Huang, Ruiqi Li, Zhiqing Hong, Zhou Zhao
  • MoExtend: Tuning New Experts for Modality and Task Extension
    Shanshan Zhong, Shanghua Gao, Zhongzhan Huang, Wushao Wen, Marinka Zitnik, Pan Zhou
  • On the Interpretability of Deep Learning Models for Collaborative Argumentation Analysis in Classrooms
    Dliang Wang, Gaowei Chen
  • Document Alignment based on Overlapping Fixed-Length Segments
    Xiaotian Wang, Takehito Utsuro, Masaaki Nagata
  • Automatically Suggesting Diverse Example Sentences for L2 Japanese Learners Using Pre-Trained Language Models
    Enrico Benedetti, Akiko Aizawa, Florian Boudin
  • Z-coref: Thai Coreference and Zero Pronoun Resolution
    Poomphob Suwannapichat, Sansiri Tarnpradab, Santitham Prom-on
  • ReMAG-KR: Retrieval and Medically Assisted Generation with Knowledge Reduction for Medical Question Answering
    Sidhaarth Murali, Sowmya Kamath, Supreetha R
  • Demystifying Instruction Mixing for Fine-tuning Large Language Models
    Renxi Wang, Haonan Li, Minghao Wu, Yuxia Wang, Xudong Han, Chiyu Zhang, Timothy Baldwin
  • Fine-Tuning ASR models for Very Low-Resource Languages: A Study on Mvskoke
    Julia Mainzinger, Gina-Anne Levow
  • Automating Qualitative Data Analysis with Large Language Models
    Angelina Parfenova, Alexander Denzler, Juergen Pfeffer
  • ANHALTEN: Cross-Lingual Transfer for German Token-Level Reference-Free Hallucination Detection
    Janek Herrlein, Chia-Chien Hung, Goran Glavaš
  • Label-Aware Automatic Verbalizer for Few-Shot Text Classification in Mid-To-Low Resource Languages
    Thanakorn Thaminkaew, Piyawat Lertvittayakumjorn, Peerapon Vateekul
  • Vector Spaces for Quantifying Disparity of Multiword Expressions in Annotated Text
    Louis Estève, Agata Savary, Thomas Lavergne
  • Narratives at Conflict: Computational Analysis of News Framing in Multilingual Disinformation Campaigns
    Antonina Sinelnik, Dirk Hovy
  • Assessing In-context Learning and Fine-tuning for Topic Classification of German Web Data
    Julian Schelb, Andreas Spitz, Roberto Ulloa
  • Knowledge Editing of Large Language Models Unconstrained by Word Order
    Ryoma Ishigaki, Jundai Suzuki, Masaki Shuzo, Eisaku Maeda
  • Exploring the Effectiveness and Consistency of Task Selection in Intermediate-Task Transfer Learning
    Pin-Jie Lin, Miaoran Zhang, Marius Mosbach, Dietrich Klakow
  • Does the structure of textual content have an impact on language models for automatic summarization?
    Eve Sauvage, Sabrina Campano, Lydia Ould-Ouali, Cyril Grouin
  • Action Inference for Destination Prediction in Vision-and-Language Navigation
    Anirudh Reddy Kondapally, Kentaro Yamada, Hitomi Yanaka
  • A Computational Analysis and Exploration of Linguistic Borrowings in French Rap Lyrics
    Lucas Zurbuchen, Rob Voigt
  • On Improving Repository-Level Code QA for Large Language Models
    Jan Strich, Florian Schneider, Irina Nikishina, Chris Biemann
  • Compromesso! Italian Many-Shot Jailbreaks undermine the safety of Large Language Models
    Fabio Pernisi, Dirk Hovy, Paul Röttger
  • Foundation Model for Biomedical Graphs: Integrating Knowledge Graphs and Protein Structures to Large Language Models
    Yunsoo Kim
  • ViMedAQA: A Vietnamese Medical Abstractive Question-Answering Dataset and Findings of Large Language Model
    Minh-Nam Tran, Phu-Vinh Nguyen, Long HB Nguyen, Dien Dinh
  • Basreh or Basra? Geoparsing Historical Locations in the Svoboda Diaries
    Jolie Zhou, Camille Lyans Cole, Annie T. Chen
  • Homophone2Vec: Embedding Space Analysis for Empirical Evaluation of Phonological and Semantic Similarity
    Sophie Wu, Anita Zheng, Joey Chuang
  • Trace-of-Thought Prompting: Investigating Prompt-Based Knowledge Distillation Through Question Decomposition
    Tyler McDonald, Ali Emami
  • Can LLMs Augment Low-Resource Reading Comprehension Datasets? Opportunities and Challenges
    Vinay Samuel, Houda Aynaou, Arijit Ghosh Chowdhury, Karthik Venkat Ramanan, Aman Chadha
  • Automatic Derivation of Semantic Representations for Thai Serial Verb Constructions: A Grammar-Based Approach
    Vipasha Bansal
  • Seed-Free Synthetic Data Generation Framework for Instruction-Tuning LLMs: A Case Study in Thai
    Parinthapat Pengpun, Can Udomcharoenchaikit, Weerayut Buaphet, Peerat Limkonchotiwat
  • Bridging Distribution Gap via Semantic Rewriting with LLMs to Enhance OOD Robustness
    Manas Madine
  • CoVoSwitch: Machine Translation of Synthetic Code-Switched Text Based on Intonation Units
    Yeeun Kang
  • An Analysis under a Unified Formulation of Learning Algorithms with Output Constraints
    Mooho Song, Jay-Yoon Lee
  • Beyond Abstracts: A New Dataset, Prompt Design Strategy and Method for Biomedical Synthesis Generation
    James O’Doherty, Cian Nolan, Yufang Hou, Anya Belz
  • Improving Sentence Embeddings with Automatic Generation of Training Data Using Few-shot Examples
    Soma Sato, Hayato Tsukagoshi, Ryohei Sasano, Koichi Takeda
  • Question-Analysis Prompting Improves LLM Performance in Reasoning Tasks
    Dharunish Yugeswardeenoo, Kevin Zhu, Sean O’Brien
  • An Individualized News Affective Response Dataset
    Tiancheng Hu, Nigel Collier
  • How Well Do Vision Models Encode Diagram Attributes?
    Haruto Yoshida, Keito Kudo, Yoichi Aoki, Ryota Tanaka, Itsumi Saito, Keisuke Sakaguchi, Kentaro Inui
  • CheckersGPT: Learning World Models through Language Modeling
    Abhinav Joshi, Vaibhav Sharma, Ashutosh Modi
  • In-Context Symbolic Regression: Leveraging Large Language Models for Function Discovery
    Matteo Merler, Katsiaryna Haitsiukevich, Nicola Dainese, Pekka Marttinen
  • STEP: Staged Parameter-Efficient Pre-training for Large Language Models
    Kazuki Yano, Takumi Ito, Jun Suzuki