Spotlights
- Why We Build Local Large Language Models: An Observational Analysis from 35 Japanese and Multilingual LLMs
Koshiro Saito, Sakae Mizuki, Masanari Ohi, Taishi Nakamura, Taihei Shiotani, Koki Maeda, Youmi Ma, Kakeru Hattori, Kazuki Fujii, Takumi Okamoto, Shigeki Ishida, Hiroya Takamura, Rio Yokota, Naoaki Okazaki - Beyond the Final Layer: Intermediate Representations Improve Multilingual Calibration
Ej Zhou, Caiqi Zhang, Tiancheng Hu, Chengzu Li, Nigel Collier, Ivan Vulić, Anna Korhonen - The Case of Spanish as a Pluricentric Language: Challenging the Monolingual Bias in NLP to Improve Cultural Adequacy of LLMs
María Grandury, Diana Galvan-Sosa
Conference Track
Workshop Track
- Bias Beyond English: Evaluating Social Bias and Debiasing Methods in a Low-Resource Setting
Ej Zhou, Weiming Lu - Unlocking Medieval Texts: How Large Language Models Transform POS Tagging for Historical Romance Languages
Matthias Schöffel, Esteban Garces Arias - Kowen: Training a Strong Bilingual LLM through Synthetic Data
Noah Lee, Jiwoo Hong, Rodrigo Martínez-Castaño, César Rodríguez - Toxicity-Aware Few-Shot Prompting for Low-Resource Singlish Translation
Ziyu Ge, Gabriel Chua, Leanne Tan, Roy Ka-Wei Lee - Disparities in LLM Accuracy and Reasoning: A Case Study on African American English
Runtao Zhou, Guangya Wan, Saadia Gabriel, Sheng Li, Alexander J Gates, Maarten Sap, Thomas Hartvigsen - Sarc7: Evaluating Sarcasm Detection and Generation with Seven Types and Emotion-Informed Techniques
Lang Xiong, Raina Gao, Alyssa Jeong, Yicheng Fu, Kevin Zhu, Sean O’Brien, Vasu Sharma - Black LLMirror: User (Self) Perceptions in Black American English Interactions with LLMs
Mikayla Campbell, Maarten Sap, Mark Diaz, Joel Mire, Daniel Chechelnitsky - Breaking mBad! Supervised Fine-tuning for Cross-Lingual Detoxification
Himanshu Beniwal, Youngwoo Kim, Maarten Sap, Soham Dan, Thomas Hartvigsen - Improving Multilingual Language Models by Aligning Representations through Steering
Omar Mahmoud, Buddhika Laknath Semage, Thommen George Karimpanal, Santu Rana - CycleDistill: Bootstrapping Machine Translation using LLMs with Cyclical Distillation
Deepon Halder, Thanmay Jayakumar, Raj Dabre - Continually Adding New Languages to Multilingual Language Models
Abraham Toluwase Owodunni, Sachin Kumar - Redteaming Leading Arabic LLMs with ASAS
Haidar Khan, Abdalghani Abujabal, M Saiful Bari, Fidaa Abed, Babar Khalid Khan - Social Norms in Cinema: A Cross-Cultural Analysis of Shame, Pride and Prejudice
Sunny Rai, Khushang Zaveri, Shreya Havaldar, Soumna Nema, Lyle Ungar, Sharath Chandra Guntuku - Can You Hear Naples? Building and Benchmarking a Neapolitan Speech Corpus
Michael Cacioli, Liam Eggleston, Jatin Sarabu, Ivory Yang, Kevin Zhu - Cross-Lingual Transfer Does Not Implicitly Occur by Jointly Pretraining on Multilingual Data
Thanmay Jayakumar, Anoop Kunchukuttan, Raj Dabre - Mark My Words: A Robust Multilingual Model for Punctuation in Text and Speech Transcripts
Sidharth Pulipaka, Ashwin Sankar, Sparsh Jain, Raj Dabre - Cross-Lingual Gender Bias in LLMs through Workplace Scenario Simulations
Aryan Gulati